Content Acquisition Services

The extensible Web Retrieval Toolkit (eWRT) has been extended via the DecarboNet project to support the consortium in various data collection tasks. In addition to retrieving data from social networks (like Facebook or Twitter), it provides helper classes for effective caching and data management. Start by downloading the latest version from Github and by reading the documentation at Read The Docs.

The webLyzard_api is another central building block that halped to connect the researchers of the consortium. It provides a client for Recognyze (named entity recognition and resolution), and a class for parsing its XML-format.