05f5f49307275b71d6a054e77f656f29e02d9bbc
[bsc-thesis1415.git] / thesis2 / abstract.tex
1 Within the leisure activity field, information is often bundled badly and
2 contains empty or wrong data. Hyperleap tries to solve this problem by bundling
3 the information from various sources including RSS feeds. Currently the
4 feedback loop for fixing site-specific crawlers requires multiple steps of which
5 multiple steps demand someone with a computer science background to perform. We
6 introduce a new adaptable crawler generation system using substring matching via
7 an adapted form of directed acyclic word graphs. The application allows users
8 with no particular computer science background to create, edit and test
9 crawlers for RSS feeds. In this way the feedback loop for broken crawlers is
10 shortened, new sources can be incorporated in the database quicker and, most
11 importantly, the information about the latest movie show, theater production or
12 conference will reach the people looking for it as fast as possible.