Here’s a question for the #hivemind : What do people use to do server-side processing of html? I generally use…

Here’s a question for the #hivemind : What do people use to do server-side processing of html? I generally use BeautifulSoup with Python, but I’m about to start a new project and am curious to see what else is out there. Java, Python, and Go are potential language choices for me (most of this will run on Google Appengine); Any suggestions?

7 thoughts on “Here’s a question for the #hivemind : What do people use to do server-side processing of html? I generally use…

  1. Okay, so you’re parsing real HTML. In that case you want lxml (on Python) which is based on libxml2.

    Also, Python 3 has HTMLParser as part of the standard library (and is based on BeautifulSoup).

    Like

  2. Not really, but it doesn’t have much takeup that I know of outside of GAE. I run into it here and there and it’s interesting, so I’m not against using it for something small to see how it, you know, goes.

    Like

Leave a comment