Friday, January 18, 2008

Parsing HTML-code

I recently went through som old bookmarks and came across a .NET component (Html Agility Pack) used for parsing malformd HTML just like it was XML. This HTML parser builds a read / write DOM document and supports XPATH and XSLT. Use this assembly if you want to:

  1. Fix or generate pages

  2. Build a web scanner. You can easy get a list of eg all img tags in the document.

  3. Build a web scraper.


The assembly is very easy to use and works great! You find it here...

Friday, January 11, 2008

Shell.NET

I recently went through som old bookmarks and came across a library for .NET with a lot of smart functions. I was looking around for a grep utility, because I needed a fast way to search some files. Grep is just one of the components in this library. It's compiled as standalone command line applications, but it's easy to use in your own command line and winforms applications. The most common applications like Grep, TextReplace, TextTabs, SameFiles, SameDirs, etc, is also compiled into a user friendly windows version.

Shell.NET is a set of more than 65 console and winform applications written in C#. It features some classical Unix applications like basename, cal, grep, tr, uniq, wc, but also some very unique tools like Barcode, RunScript or TextTransform, a awk-like text processing application accepting scripts written in any of the languages supported by the installed .NET framework.

Take a look at the project, download it, and try to use it! I found it very useful...