The program consist of three modules named: - Converter: hypconvert - Crawler: hypcrawl - Frontend: hypfront The frontend consists of submodules for different data sources. For example rss, html, email. The frontend guides the user with categorizing data fields in the source and outputs it in a database which can be read by the crawler and the converter. The crawler periodically collects the data specified by the frontend and passes it through to the converter for processing. The converter processes the data aquired from the crawler with the patterns and rules given by the frontend and outputs the structured data in the specified xml format.