gitignore proposal
[bsc-thesis1415.git] / softwaredesign / workflow.txt
1 The program consist of three modules named:
2 - Converter: hypconvert
3 - Crawler: hypcrawl
4 - Frontend: hypfront
5
6 The frontend consists of submodules for different data sources. For example
7 rss, html, email.
8 The frontend guides the user with categorizing data fields in the source and
9 outputs it in a database which can be read by the crawler and the converter.
10
11 The crawler periodically collects the data specified by the frontend and passes
12 it through to the converter for processing.
13
14 The converter processes the data aquired from the crawler with the patterns and
15 rules given by the frontend and outputs the structured data in the specified
16 xml format.