A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.
-
Updated
May 28, 2026
A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.
Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing
SutroYaro — Sutro Group research workspace for energy-efficient AI training. Point any coding agent at the repo and it becomes a research agent. 34 experiments, eval environment, weekly catch-ups, multi-researcher workflow.
🤖 CodeForge AI: An autonomous multi-agent coding system powered by LangGraph for agentic software development and automated workflows. SOTA custom agentic GraphRag, shared-state memory, auto-model routing for cost optimization, and a range of custom tooling.
Faraday: An Autonomous Web Research Agent (LangGraph/Streamlit). 🕵️♀️ Investigates queries using dynamic tools (Tavily, Google, NewsAPI, etc.), gathers multi-source info, and synthesizes structured reports in a Streamlit UI. Features agentic workflow & source tracking.
LitReview Skill is an installable agent skill for end-to-end literature review generation. It helps agents conduct literature reviews with a well-designed and widely used review framework so the search process is broad, iterative, and less likely to miss relevant articles.
Foundation for an open strong-agent platform: controllers, operators, skills, A2A, runtime, and graph execution.
Lightweight Python CLI for the Exa API (Search, Contents, Find Similar, Answer, Research, Context) with JSON-first output, SSE streaming, and model-aware polling. LLM‑agnostic: integrate with OpenAI Agents SDK/Codex CLI or Claude tool use by invoking CLI commands, no MCP server required.
Faraday: An Autonomous Web Research Agent (LangGraph/Streamlit). 🕵️♀️ Investigates queries using dynamic tools (Tavily, Google, NewsAPI, etc.), gathers multi-source info, and synthesizes structured reports in a Streamlit UI. Features agentic workflow & source tracking.
Six MCP servers that automate the full academic research pipeline — from refining a vague research question to generating a publication-ready report. Each server handles a distinct stage of the workflow: question development, data processing, code generation, script execut
Curated paper-related AI skills and GitHub repositories for idea discovery, literature search, experiments, writing, citations, LaTeX/DOCX, review, and submission.
🤖 Build and interact with Claude Agent using this Python SDK for seamless integration and efficient asynchronous querying.
Benchmark whether agent skills actually improve research and engineering tasks.
An advanced agentic workflow implementation using LangGraph and LangChain, featuring iterative research, autonomous planning, and persistent state management for high-quality content generation.
Agent skills and workflows for reproducible ML research.
A curated collection of research agents, skill libraries, autonomous research loops, paper-writing pipelines, MCP servers, and benchmarks built around Claude Code, OpenAI Codex CLI, and adjacent coding-agent CLIs for AI/ML research.
Track public autoresearch use cases across industries with a curated list of repos, write-ups, and discussions
Portable AI agent skills and specialist subagents for prompt enhancement, workspace resume, source-grounded research, and release readiness.
this is a tool to use AI agents to help with job applications
Autonomous ML research loops for Claude Code with mechanical anti-fabrication guards.
Add a description, image, and links to the research-agents topic page so that developers can more easily learn about it.
To associate your repository with the research-agents topic, visit your repo's landing page and select "manage topics."