True, false, or just opinions? Maybe not binary, but a percentage.
Fact-checking tools to combat disinformation.
Extract a list of statements from given text.
For each statement search via search engine and read the top URLs.
For each hostname as one source, extract most related info from the read content.
For each sources, generate one verdict and citation with the extracted content.
Combine all verdicts of one statements into a final verdict.
Return a list of statements with verdicts, citations and others related.
Online demo: https://check.ittia.net
Using existing API: https://github.com/ittia-research/check/tree/main/packages/ittia_check
Main components:
- Check server: see docker-compose.yml
- LLM: any OpenAI compatible API, self-hosting via vllm or Ollama
- Embedding: self-hosting via Ollama or Infinity
- Rerank: self-hosting via Infinity
- Search: https://search.ittia.net
- Start a wiki_dpr retrieval server (ColBERTv2) for development: https://github.com/ittia-research/check/tree/main/datasets/wiki_dpr
- Using
search.ittia.net
for better optimization. - Features:
- Customizable source count.
- Supports search sessions: streaming, resuming.
- Utilizes state-of-the-art search engine (currently Google).
Input something.
Analyze percentage of facts and opinions.
Fact-check like what a researcher will do:
- Use search engine as data source and AI as the verdict.
Output analysis:
- MARKDOWN as the default format, JSON optional.
- For new information, doubts as default, fact-check follows.
Input types:
- facts
- opinions
- questions
Verdicts:
- false
- true
- tie: false and true verdicts counts are the same and above zero
- irrelevant: context processed irrelevant to the statement
- https://www.snopes.com
- https://www.bmi.bund.de/SharedDocs/schwerpunkte/EN/disinformation/examples-of-russian-disinformation-and-the-facts.html
- TPU Research Cloud team at Google
- Google Search
- Jina Reader