Feedparser

📅 27 Feb 2021

Did you know python has a pretty handy library for RSS feed parsing?

I didn’t. I only recently discovered it while creating my third website.

My new website is plugging along just fine. I recently made the decision to use PHP in addition to HTML and CSS because I wanted some additional functionality. Specifically, I plan to update my new website with movie reviews and lists of movies with my favorite actors/actresses.

So, how do I do that?

Well, I knew that I wanted to use python for part of it. I probably could have used PHP, but I don’t know enough yet and I try my best to keep my python proficiency up.

As I started working in python, I was using both beautiful soup and regular expressions, but it just seemed cumbersome.

A quick web-search later and I discovered feedparser. Here is what it looks like:


  from bs4 import BeautifulSoup as bs
  import feedparser

  feed = feedparser.parse('https://letterboxd.com/rmoore35/rss/')

  # Top Three Films
  for key in feed.entries[0:3]:
      tags = bs(key.description, "lxml").find_all('p',limit=10)
      lst = []
      for tag in tags[1:]:
          lst.append(tag.string)

      s = ' '.join(map(str, lst)).rstrip().replace("\'", "")
      new_title = key.title.split(",")
      top3_dict = {"title":new_title[0],"link": key.link,"description":s}
      for key,value in top3_dict.items():
           file1.write(value + "\n")

Note: This is kinda messy right now. I write things very rough at first, then I condense them down. I hope I’m not alone in this :)

Another note: I ended up using BeautifulSoup for a little bit of the code, as feedparser didn’t seem to handle the description tag very well, I think due to the way the Letterboxd rss feed is formatted. Since I won’t be using this code for anything else I wasn’t that worried about making it one-size-fits-all.

But what that little snippet does is finds the title of my three favorite films in the feed, as well as the url and the description of my review. I then write each on a new line in a txt file.

I also use very similar code to pull lists of every film I have watched for my favorite actresses/actors.

The next step will be to read that txt file with PHP and add that to the html for my site. Exciting stuff.

Day 33: #100DaysToOffload

💬 Looking for comments?

I don't have comments on this site because I don't feel like managing them.

Instead of leaving a comment, feel free to ✉️ contact me instead.