Memoss

Memory that grows naturally, like moss.

Your file system is your knowledge base. Agents compile, maintain, and cross-reference it continuously.

中文文档

What is Memoss?

Memoss is an agent-native knowledge runtime — a system where LLM agents continuously build and maintain a living knowledge base from your raw sources (documents, web pages, data schemas, code, conversations), stored as plain Markdown files on your file system.

Unlike traditional wikis where humans do the bookkeeping (and it goes stale), or RAG systems where knowledge is re-derived from scratch on every query — in Memoss, agents do the maintenance, and knowledge is compiled once, kept current, and always ready.

The Core Idea

You drop sources. Agents build knowledge. You ask questions. Agents keep it alive.

Phase 1 operations:

Ingest — Drop a URL, file, or repo. The agent reads it, extracts knowledge, updates every related page, and commits (to a draft branch for your review).
Query — Ask a question. The agent searches the knowledge base, synthesizes an answer with citations, and can file the answer back as a new page.
Lint — The agent checks for contradictions, stale claims, orphan pages, and missing links. Knowledge stays healthy.
Extract — Convert a source to Markdown under sources/extracted/ using skills.sh skills (used automatically during ingest, or standalone via memoss extract).

Coming in Phase 2: Enrich · Discover · Sync · Publish · Bridge (OKF ↔ enterprise catalog)

Dual-Track Knowledge

Memoss serves two tracks that can be used independently or together:

Track	Format	Best for
Wiki	OKF vault (Markdown + YAML frontmatter)	Personal research, team wikis, reading notes
Catalog	Metadata-as-Code snapshot (`catalog.yaml` + entries)	Data asset documentation, enterprise catalog sync

The Bridge layer converts between them — compatible with Google knowledge-catalog mdcode tooling.

Why Markdown Files?

Because Markdown is the only format that is readable by humans, parseable by agents, and diffable in git — all without tooling. You can cat a file, grep a concept, git diff a change, and open the vault in Obsidian for graph view. No proprietary database. No lock-in.

Quickstart

From source (development)

git clone https://github.com/deeploop-ai/memoss.git
cd memoss
pnpm install
pnpm nx build core cli mcp-server

# Set your LLM API key
export ANTHROPIC_API_KEY=sk-ant-...

# Create a knowledge base (default: ~/.memoss-vault)
node apps/cli/dist/main.js init --pack research

# Or specify a path
node apps/cli/dist/main.js init ./my-knowledge --pack research
cd ./my-knowledge

# Ingest your first source (writes to draft branch)
node apps/cli/dist/main.js ingest "https://example.com/article" --type web

# Review and merge agent changes
node apps/cli/dist/main.js approve

# Ask a question
node apps/cli/dist/main.js query "what does this article say about X?"

# File the answer back into the knowledge base
node apps/cli/dist/main.js query "compare X and Y" --save

# Check knowledge base health
node apps/cli/dist/main.js lint

# Visualize the knowledge graph
node apps/cli/dist/main.js graph

# Start MCP server for other AI agents (agent tools only by default)
node apps/cli/dist/main.js mcp serve
node apps/cli/dist/main.js mcp serve --capabilities agent,read
node apps/cli/dist/main.js mcp serve --capabilities full

Global install

npm install -g @memoss/cli
memoss --version

# Default vault at ~/.memoss-vault
memoss init
memoss ingest "https://example.com/article" --type web
memoss approve
memoss query "what does this article say about X?"
memoss graph
memoss mcp serve

Configuration

Path	Purpose
`~/.memoss-vault`	Default vault when none is specified
`~/.memoss/config.yaml`	User-level shared config (model defaults, merged into vault config)
`MEMOSS_VAULT_PATH`	Override vault location
`MEMOSS_MCP_CAPABILITIES`	MCP tool levels: `agent`, `read`, `write`, or `full`

Vault discovery order: --vault / -C → MEMOSS_VAULT_PATH → walk up from cwd → ~/.memoss-vault.

Ingest guards and policies

memoss ingest runs a pipeline before and during writing pages:

Extract (optional) → Validate (on by default) → Tuning (on by default) → Ingest Agent → draft branch for review

Hard blocks (ingest aborts, vault unchanged)

Validate runs unless you pass --skip-validate. It has two layers:

Heuristic checks (deterministic) — any match rejects immediately:
- Raw HTML page shells instead of article text
- Heavy <head> / <script> scaffolding
- Fewer than ~80 characters of meaningful text
- Replacement characters (U+FFFD) suggesting encoding corruption
- Content resembling HTTP 404/403 error pages
- Broken PDF text extraction (fragmented short lines, vertical CJK splits, etc.)
Validate Agent — LLM review of the same material. Also rejects login walls, CAPTCHA pages, paywall teasers, empty stubs, and nav/footer chrome with little body text. If the agent does not submit a verdict, ingest is aborted as a precaution.

Extract failures can also stop ingest before validate — e.g. audio / video / unknown formats with no installed extraction skill and no built-in fallback.

memoss extract exits when extracted content is flagged needs_manual_review (e.g. broken PDF). memoss ingest only warns and continues.

Soft limits (ingest proceeds, content may be trimmed)

Stage	Blocks ingest?	What it does
Tuning	No	Plans which pages to create/update; `skip_patterns` apply to low-signal sections inside a source (footer nav, changelog indexes), not the whole source. Override with `--emphasis`.
Ingest Agent	No	Skips low-value sections (marketing fluff, nav chrome). Won't skip an entire user-submitted source. New topics get new `topics/` pages even when the vault theme differs. May omit spec-level detail (raw BNF, legacy appendices) in favor of concept pages — check the agent summary Skipped line.
Write policies	On `write_page`	Defaults in `.memoss/config.yaml` → `policies`: must `read_page` before updating; preserve existing `#` headings and `resource` frontmatter (`error`); warn on large body shrink, missing `# Citations`, or meta-page reference slugs.

After ingest

When git.draft_branch is enabled (default), changes land on a draft branch — review with memoss approve or memoss reject. If ingest completes but no pages were created or updated, retry with --emphasis or --skip-tuning.

Useful flags

memoss ingest <source> \
  --skip-validate    # bypass validate gate (use with care)
  --skip-tuning      # skip tuning planning pass
  --no-extract       # ingest source as-is (no extract step)
  --emphasis "..."   # steer what to keep or prioritize
  --no-draft         # write directly to the current branch

Configure extraction skills and trust in .memoss/config.yaml under extraction.*. See Extraction Skills Design.

MCP in Cherry Studio

Use STDIO transport. Put the executable in Command and subcommands in Arguments:

Field	Value
Command	`memoss`
Arguments	`mcp` / `serve`
Environment	`MEMOSS_VAULT_PATH=/path/to/vault`

Agent tools (default)

By default, MCP exposes agent runners only:

Tool	Use when
`run_ingest`	Add a URL, file, or source to the knowledge base (extract + analyze + update pages). Prefer this when the user wants to ingest or save content.
`run_ingest_status`	Poll an async `run_ingest` job by `jobId`.
`run_extract`	Extract-only: write Markdown to `sources/extracted/`. Does not update the wiki. Use only when extraction alone is requested.
`run_query`	Ask questions against the knowledge base. Prefer this over calling `search_kb` directly.
`run_lint`	Check knowledge base health (contradictions, stale pages, broken links).

run_ingest runs extract automatically when needed (extract: auto). For requests like “add this URL to my knowledge base”, the client should call run_ingest, not run_extract.

To expose low-level read/write tools:

memoss mcp serve --capabilities agent,read
memoss mcp serve --capabilities full

Full command reference: docs/cli-reference.md · OKF format: docs/okf-spec.md

Architecture

apps/
└── cli/                    @memoss/cli     init · ingest · query · lint · approve · graph · mcp

packages/
├── core/                   @memoss/core    OKF · agents · tools · policies
├── mcp/                    @memoss/mcp-server
├── catalog-bridge/         @memoss/catalog-bridge   (Phase 2a — MaC sync)
└── search/                 @memoss/search           (Phase 2b — hybrid retrieval)

schema-packs/               personal · research · data-catalog templates

Phase 2 adds apps/web/ (Next.js UI) and apps/desktop/ (Obsidian-compatible vault + agent chat).

Built with:

Agent Engine — Vercel AI SDK (multi-provider, type-safe tool calling)
Monorepo — Nx + pnpm workspaces
Formats — OKF (wiki) + MaC (catalog)
Language — TypeScript

Where This Comes From

Memoss productizes the convergence of two independent designs:

Google knowledge-catalog — OKF format, Metadata-as-Code sync, enrichment/discovery agents, MCP-first architecture.
Karpathy's LLM Wiki Pattern — Three-layer model (sources → wiki → schema), ingest/query/lint, interactive curation, Obsidian workflow.

One enterprise, one personal. Same architecture. The pattern is validated. The category is undefined.

Roadmap

Phase	Timeline	Focus
Phase 1a	0–4 months	CLI + core loop (ingest/query/lint), draft review, MCP, graph viewer
Phase 1b	4–6 months	Web crawl ingest, provenance, lint health score, examples
Phase 2a	0–8 weeks	Serial Read, provenance closure, E2E, image ingest
Phase 2b	8–20 weeks	Local Discover, hybrid search, publish bundle, retrieval eval
Phase 3	18+ months	Hosted platform, bundle marketplace, enterprise

Full details: Product Design v0.2 · Phase 1 Plan · Phase 2 Plan (personal/small teams) · Phase 1 Technical Design · Extraction Skills Design · Serial Read Design · CLI Reference · OKF Spec

Contributing

Memoss is in early development. See CONTRIBUTING.md for contribution guidelines (coming soon).

License

Memoss is open source under the Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.agents/skills		.agents/skills
.claude		.claude
.codex		.codex
.cursor		.cursor
.gemini		.gemini
.github		.github
.opencode		.opencode
.verdaccio		.verdaccio
.vscode		.vscode
apps/cli		apps/cli
docs		docs
examples/ga4-ecommerce		examples/ga4-ecommerce
packages		packages
schema-packs		schema-packs
scripts		scripts
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
README_ZH.md		README_ZH.md
eslint.config.mjs		eslint.config.mjs
nx.json		nx.json
opencode.json		opencode.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.base.json		tsconfig.base.json
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Memoss

What is Memoss?

The Core Idea

Dual-Track Knowledge

Why Markdown Files?

Quickstart

From source (development)

Global install

Configuration

Ingest guards and policies

Hard blocks (ingest aborts, vault unchanged)

Soft limits (ingest proceeds, content may be trimmed)

After ingest

Useful flags

MCP in Cherry Studio

Agent tools (default)

Architecture

Where This Comes From

Roadmap

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Memoss

What is Memoss?

The Core Idea

Dual-Track Knowledge

Why Markdown Files?

Quickstart

From source (development)

Global install

Configuration

Ingest guards and policies

Hard blocks (ingest aborts, vault unchanged)

Soft limits (ingest proceeds, content may be trimmed)

After ingest

Useful flags

MCP in Cherry Studio

Agent tools (default)

Architecture

Where This Comes From

Roadmap

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages