The evidence collected by the project supporters might be suitable for reuse.
We can’t allow ourselves to:
- allow or enable social media intelligence.
- provide API which might help additional user profiling if a malicious third party want to use the fbtrex data
The data should fit these requirements:
- be sure the content was meant to be public by the original author
- unlinked from the user supporting the project with their unique observations of the social media
- Contains at least one topic of public interest
- Be searchable only by topics of public interest
To do point 2 and 3, we assume all the voices in Wikipedia are topic in the public interest. (this contributes to distributing responsibilities instead of centralizing it, and to resort on a collective intelligence). Every observed public post, from every browser with installed the extension, get their text (if any, and if it is longer than 140 chars) to be semantically analyzed. It allows anyone to subscribe to RSS feed by following the semantic topics available. It is a preliminary example of a basic algorithm.
Further readings
- the public API format documentation, originally on this github issue, and ported to our API documentation page.
- the project status 2018, it explain our challenge and process.