A tool to screen new ArXiv papers
Arxiv-digest is a research tool designed to help users filter and prioritize new arXiv papers based on customizable criteria. It computes relevance scores using keyword matching, author preferences, and optional LLM analysis. The output is a ranked HTML report of papers with scores, reasons, and tags for efficient review.
- ▪The tool retrieves recent arXiv papers and assigns a local relevance score based on keywords, authors, and categories defined in a configuration file.
- ▪A two-stage scoring system first applies deterministic rules and optionally uses an LLM to provide an additional relevance score and filtering decision.
- ▪Users can customize positive and negative keywords, preferred or blocked authors, and adjust scoring weights to tailor the tool to specific research topics.
- ▪The final output is an HTML report listing papers with titles, abstracts, scores, reasons for inclusion, and tags for quick assessment.
- ▪The tool supports topics spanning multiple arXiv categories and is adaptable by editing the config file without requiring code changes.
Opening excerpt (first ~120 words) tap to expand
Arxiv-digest arxiv-digest is a small research tool for researchers who want to keep reading new arXiv papers without manually sorting through the daily volume of submissions. The user defines a research topic in config.toml: arXiv categories, positive keywords, negative keywords, optional author preferences, and an optional LLM instruction. The program retrieves recent arXiv records, computes an explicit relevance score for each paper, and writes a local HTML report ordered for reading. The project is designed for topics that are distributed across several arXiv categories and cannot be captured by one keyword query.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at GitHub.