AI & ML

Last updated Jun 2026

Project Q&A

Overview

Orbit is a personal relationship knowledge graph built from my own iMessage history. It pairs two MCP servers — one for reading messages, one for the knowledge graph — with a Next.js + Cytoscape frontend. An MCP-capable LLM client runs the seed and update prompts against the local servers, then I browse the results in the browser (or an Electron window). The whole thing is local-only and macOS-only; nothing leaves my machine.

Problem Solved

iMessage holds years of relationship signal — who I actually talk to, who I've drifted from, when conversations turned warm or cold — but the Messages app shows only the most recent thread. Orbit turns that latent history into a queryable graph and a set of dashboards, so I can spot ghosts, recognize who chases whom, and see a real year-in-review of my conversations.

Target Users

Me, primarily — a single-user tool that runs on my Mac against my own message store.
Curious engineers — the codebase is a worked example of building a real app on top of MCP servers, with the integration logic encoded as Markdown prompts instead of a custom backend.

Key Features

Tagged-observation graph schema

Each Person entity carries [freq], [topic], [tone], [bio], and optional [sent] observations. Updates replace tagged lines instead of appending, so the graph stays clean across many refreshes.

Network visualization

Cytoscape renders the people graph; Louvain community detection assigns colors to clusters so I can see friend groups at a glance.

Per-person deep dive (`/person/[name]`)

Frequency, rhythm fingerprint, topic cloud, sentiment arc, response-time stats, attachment gallery, and message timeline for any contact.

Drift detection

/ghosts surfaces people I've lost touch with; /initiation shows who chases whom; /responsiveness shows reply latency; /hygiene shows unnamed handles with AddressBook candidate names.

Wrapped slideshow

A Spotify-Wrapped-style year-in-review of conversations, sentiment, top contacts, busiest day, and hour fingerprint.

Electron wrapper

Optional native window with an in-app Refresh button that runs the update prompts in the background via the local LLM client.

Technical Highlights

MCP servers as the integration layer

Both iMessage SQLite parsing and a knowledge-graph store already exist as MCP servers (mac-messages-mcp and @modelcontextprotocol/server-memory). Wiring them into .mcp.json and encoding the workflow in Markdown prompts meant I never had to write or maintain integration code — the prompts are the application logic for the seeding pipeline.

Tag-replace semantics for idempotent updates

Any incremental ingestion has to avoid duplicating data. The convention "each Person has exactly one [freq] line, one [topic] line, etc., and updates delete-then-add the entire line" makes the update loop trivially safe. The frontend parses with a one-liner: obs.find(o => o.startsWith("[topic]")). No schema, no migrations, no merge conflicts.

Reading SQLite from server components

Some views (gallery, per-person message bodies, attachment IDs) need raw data the MCP server doesn't expose. Opening chat.db directly with better-sqlite3 in a server component, marked server-only, gives those pages full access without exposing anything to the browser bundle. The Next.js request-memoization layer means each route opens the DB at most once per request.

Entity disambiguation via the `/hygiene` view

The same person can show up under multiple handles (phone, email, iMessage IDs). Automatic merging either over-merges (similar names, different people) or under-merges (same person, different handle). /hygiene lists unnamed handles next to AddressBook candidate names so I can confirm-and-merge by hand, which is right far more often than any heuristic I tried.

One design substrate across ~18 routes

Every page combines charts, filters, and tables, which drifts fast if each is styled independently. The frontend sits on a token-based theme (src/lib/theme/) and a library of presentational primitives (src/components/ui/) — mastheads, filter bars, sortable column headers, stat grids, the node selection panel — so each route is a thin composition rather than bespoke markup, and a restyle is a token edit. Interactivity (column sorting via a small useSortableRows hook, network layout/size filters, node selection) lives in client wrappers, while the data-loading pages stay server components that read SQLite directly and pass already-rendered content (like a sidebar aside) down as props, keeping the client bundle lean.

Engineering Decisions

Two MCP servers instead of a custom integration

Constraint: Needed iMessage reads and a knowledge-graph store.
Options: Write a custom Node/TS backend for both; or wire existing MCP servers.
Choice: Wire mac-messages-mcp and @modelcontextprotocol/server-memory via .mcp.json.
Why: Keeps the repo focused on the schema convention and visualization. No custom server code to maintain.

JSONL knowledge graph, not a database

Constraint: Persistent store the memory MCP server already writes to, that the frontend can re-read.
Options: SQLite, Postgres, embedded graph DB, raw JSONL.
Choice: Use the memory server's native memory.jsonl file directly from the Next.js loader.
Why: Zero schema migrations, easy to inspect by hand, append-only writes match the incremental flow. Full re-parse on every request is acceptable for ~100 contacts.

Tagged observations with replace-the-line semantics

Constraint: Updates needed to refresh stats without endlessly appending duplicate observations.
Options: Numeric versioning per observation; full entity replacement; tagged-line replacement.
Choice: One [freq], one [topic], one [tone], one [bio], optional [sent] per Person — updates delete the old tag and add the new line.
Why: Predictable parsing in graph.ts, idempotent updates, human-readable in the JSONL.

Read SQLite directly from the frontend

Constraint: Some views need raw message data the MCP server doesn't expose.
Options: Extend the MCP server, build a separate API, or open chat.db directly in server components.
Choice: Open chat.db and AddressBook read-only with better-sqlite3, marked server-only.
Why: Avoids tunneling everything through the MCP layer; trade-off is tight coupling to macOS file paths.

Electron wrapper as opt-in, not the default

Constraint: Native window is nicer for daily use, but adds a heavy dependency.
Options: Browser-only, Electron-only, or both with one as default.
Choice: Keep electron in devDependencies; the browser flow (npm run dev) remains canonical and npm run app is opt-in.
Why: Contributors can run the app without Electron; the wrapper earns its weight only for the in-app Refresh button.

Frequently Asked Questions

How is the graph actually built?

By an MCP-capable LLM client. The user pastes prompts/bootstrap.md into a session opened in this directory. The agent reads messages via the messages MCP server, extracts topics/tone/bio per contact, and writes entities + relations via the memory MCP server. The memory server appends to memory.jsonl, which the Next.js app then re-reads on every page render.

Why MCP servers instead of just a script?

Two reasons. First, the integrations already existed as MCP servers — no need to reimplement them. Second, the seeding logic involves real judgment (which threads are transactional? what's a good one-line tone rationale?). Putting it in a prompt means a reasoning model does that work every run, rather than me trying to encode rules in code.

Why not use a real database for the graph?

For personal-scale data (~100 contacts), parsing a JSONL file on every page render is fast enough and the operational simplicity is worth it. No migrations, no ORM, easy to inspect by hand, and the MCP server already writes to that format.

How does sentiment scoring work?

prompts/sentiment.md scores each top contact per month (−1 to +1, with a confidence and short rationale) and appends JSONL rows to viz/data/sentiment.jsonl. The score is also written back to the Person entity as a [sent] observation. The /sentiment page renders each contact as a diverging bar (warm to the right, strained to the left) with sortable columns; the per-person view renders the time series as an arc.

What happens to personal data?

Everything stays local. viz/data/sentiment.jsonl and viz/data/entity_handles.json contain real names and sentiment rationales, so they're gitignored. The published repo has only source code, prompts, and conventions — no actual messages, contacts, or graph contents.

Why is the Electron wrapper optional?

The browser flow (npm run dev) is the canonical path. Electron adds a heavy dependency and only earns its weight if you want the in-app Refresh button, which shells out to the LLM client headlessly against the update prompts. For most use it's overkill.

How does Orbit handle group chats?

Group threads get their own page (/groups) with member counts and activity, and per-thread drilldowns. The graph treats group-chat-only contacts differently from 1:1 contacts so they don't drown out the people you actually talk to directly.

Technology Stack

Core Technologies

Category	Technology	Version	Purpose
Language	TypeScript	^5	Type-safe app code across server + client
Web framework	Next.js	16.2.6 (Turbopack)	App-router pages, request-memoized data loading, server components for SQLite reads
UI runtime	React	19.2.4	Server + client components
Styling	Tailwind CSS	v4 (+ `@tailwindcss/postcss`)	Utility-first styling over a token-based theme (`src/lib/theme/`) and shared UI primitives (`src/components/ui/`)
Graph rendering	Cytoscape	^3.33	Network view at `/network` with selectable layout (force-directed / concentric) and node sizing
Data table	TanStack Table	^8.21	Powers the full sortable + filterable data table at `/table`
List sorting	custom `useSortableRows` hook	—	Client-side column sorting for the ranked list pages (`/ghosts`, `/sentiment`, `/responsiveness`, `/initiation`, …)
Graph math	graphology + graphology-communities-louvain	^0.26 / ^2.0	Community detection for cluster coloring
SQLite	better-sqlite3	^12.10	Direct read-only access to `chat.db` and AddressBook
Image processing	sharp	^0.34	Attachment thumbnails for `/gallery`

Frontend

Framework: Next.js 16 with the App Router and Turbopack as the default bundler
State management: None — server components fetch, client components receive props; small useState/useMemo islands where interactive (sort controls, network filters, selection panels)
Styling: Tailwind v4 (via postcss.config.mjs) layered over a custom token-based theme in src/lib/theme/ — a single dark "Observatory" palette resolved through CSS custom properties — plus a library of shared presentational primitives in src/components/ui/ so every route shares one visual language and interaction pattern
Build tool: Turbopack (replaces Webpack; ~260ms cold start observed)
Fonts: IBM Plex Sans + IBM Plex Mono via next/font/google

Backend / Data Layer

Runtime: Node.js 20+ (per @types/node: ^20)
Data sources:
- ~/Library/Messages/chat.db — iMessage SQLite database (read-only)
- macOS AddressBook SQLite (read-only) for contact-name fallback
- memory.jsonl from the server-memory MCP for the knowledge graph
- viz/data/* — generated artifacts (sentiment scores, handle→entity map)
API style: Next.js server components + a single /api/attachment/[id] route for streaming attachment bytes
Authentication: None — local-only single-user tool

MCP Layer

Server	Transport	Source	Role
`messages`	stdio via `uvx`	`mac-messages-mcp` (PyPI)	Read messages, search, find contact, list chats
`memory`	stdio via `npx`	`@modelcontextprotocol/server-memory` (npm)	Knowledge graph store — entities, relations, observations

Both registered in .mcp.json at project scope; an MCP-capable LLM client auto-loads them when started in this directory.

Desktop Wrapper (optional)

Framework: Electron ^42.2
Architecture: electron/main.js spawns Next as a child process on port 3737, waits for the URL to respond, opens a BrowserWindow. electron/preload.js exposes an IPC bridge for the in-app Refresh button to invoke the local LLM client headlessly against prompts/update.md + prompts/sentiment.md.

Infrastructure

Hosting: N/A — runs entirely on the user's local Mac
CI/CD: None set up
Monitoring: None — manual inspection of memory.jsonl and the UI

Development Tools

Package manager: npm
Linting: ESLint 9 with eslint-config-next
Formatting: None enforced (no Prettier config in repo)
Testing: node --test with tsx for *.test.ts files — pure logic is factored out so it can be tested without the framework (e.g. src/lib/sort-rows.test.ts, src/lib/sentiment.test.ts, src/components/ui/ToneDot.test.ts)
Type checking: npx tsc --noEmit (no separate typecheck script)
Concurrency utility: concurrently + wait-on available for orchestrating Electron + Next together

Key Dependencies

Package	Purpose
`next`	Framework — App Router, server components, Turbopack
`react` / `react-dom`	UI runtime
`cytoscape`	Force-directed network rendering on `/network`
`graphology` + `graphology-communities-louvain`	In-memory graph + Louvain community detection
`@tanstack/react-table`	Headless sort/filter for the full data table at `/table`
`better-sqlite3`	Synchronous SQLite access to `chat.db` + AddressBook
`sharp`	On-the-fly attachment thumbnail generation
`electron`	Optional native desktop wrapper
`tsx`	TypeScript loader for `node --test` and CLI smoke scripts
`tailwindcss` + `@tailwindcss/postcss`	Styling

Conventions Worth Knowing

No state library. Data flows top-down via server components; interactivity is local.
memory.jsonl is the source of truth for the relationship graph; treat it as append-only and use delete_observations + add_observations to update tagged lines (never duplicate).
viz/data/ is gitignored — those files contain personal contact names and sentiment scores. Don't commit them.
Path constants in viz/src/lib/sqlite.ts and viz/src/lib/graph.ts are macOS- and machine-specific (AddressBook UUID, npx cache hash). Update if cloning to another machine.

Architecture Overview

System Diagram

flowchart TD
    subgraph User["Driver"]
        Agent["MCP-capable LLM client<br/>(reads .mcp.json)"]
    end

    subgraph macOS["macOS data sources (read-only)"]
        ChatDB[("~/Library/Messages/chat.db")]
        AddrBook[("AddressBook DB")]
    end

    subgraph MCP["MCP servers (stdio)"]
        Messages["messages<br/>mac-messages-mcp"]
        Memory["memory<br/>@modelcontextprotocol/server-memory"]
    end

    subgraph Storage["Generated state"]
        MemFile[("memory.jsonl<br/>knowledge graph")]
        SentFile[("viz/data/sentiment.jsonl")]
        HandlesFile[("viz/data/entity_handles.json")]
    end

    subgraph Viz["viz/ — Next.js 16 app"]
        Loader["src/lib/graph.ts<br/>loadGraph()"]
        SQL["src/lib/sqlite.ts<br/>chat.db + AddressBook readers"]
        Pages["App-router pages<br/>/network /table /heatmap<br/>/sentiment /wrapped /person …"]
        UI["Cytoscape + TanStack Table<br/>+ custom charts"]
    end

    subgraph Desktop["Optional desktop wrapper"]
        Electron["Electron main<br/>spawns Next + LLM client"]
    end

    Agent -- "tool_get_recent_messages" --> Messages
    Agent -- "create_entities / add_observations" --> Memory
    Messages -- "SQL" --> ChatDB
    Memory -- "append JSONL" --> MemFile
    Agent -- "writes sentiment scores" --> SentFile
    Agent -- "writes handle map" --> HandlesFile

    Loader --> MemFile
    SQL --> ChatDB
    SQL --> AddrBook
    Pages --> Loader
    Pages --> SQL
    Pages --> SentFile
    Pages --> HandlesFile
    Pages --> UI

    Electron --> Pages
    Electron -. "Refresh button<br/>runs LLM client headless" .-> Agent

Component Descriptions

MCP layer (`.mcp.json`)

Purpose: Expose iMessage and the knowledge graph as tools that an MCP-capable LLM client can call directly.
Location: /.mcp.json (project scope, auto-loaded by the LLM client when started in this directory).
Key responsibilities: Declares two stdio MCP servers — messages (via uvx mac-messages-mcp) and memory (via npx @modelcontextprotocol/server-memory). No env vars or secrets.

Prompt-driven workflow (`prompts/`)

Purpose: Encode the multi-phase workflows for seeding, updating, scoring sentiment, and querying — to be pasted into the LLM client.
Location: /prompts/{bootstrap,update,sentiment,query}.md.
Key responsibilities: Enforce the tagged-observation convention so the graph stays parseable. Phase-gate each run so the user can confirm/correct before writes.

Memory server data file

Purpose: Persistent store for the knowledge graph (entities + relations).
Location: ~/.npm/_npx/<hash>/node_modules/@modelcontextprotocol/server-memory/dist/memory.jsonl (resolved in viz/src/lib/graph.ts:7-10).
Format: Newline-delimited JSON. Each line is either {type:"entity", name, entityType, observations[]} or {type:"relation", from, to, relationType}.
Why JSONL: Append-friendly for the MCP server's incremental writes, easy to grep, no schema migrations.

Graph loader (`viz/src/lib/graph.ts`)

Purpose: Parse memory.jsonl into typed nodes/edges and decorate with Louvain community IDs.
Key responsibilities: Tag parsing ([freq], [topic], [tone], [bio], [sent]), community detection via graphology-communities-louvain, color assignment. Memoized per request by Next.js.

SQLite readers (`viz/src/lib/sqlite.ts`)

Purpose: Read iMessage chat.db and macOS AddressBook directly via better-sqlite3 (readonly).
Key responsibilities: Lazy DB open with caching, phone-number normalization. Used by pages that need raw message bodies, attachments, or contact-name fallback (e.g., /person/[name], /gallery, /wrapped).

Frontend pages (`viz/src/app/*`)

Purpose: Each route is a focused lens onto the graph.
Key responsibilities: /network renders Cytoscape; /table, /groups, /sentiment, /hygiene, /responsiveness, /ghosts, /initiation are sortable tables; /heatmap, /calendar, /activity, /onthisday are time-based views; /wrapped is a slideshow; /person/[name] is the per-contact deep dive.

Design substrate (`viz/src/lib/theme/` + `viz/src/components/ui/`)

Purpose: Give every route one visual language and interaction model instead of per-page styling.
Key responsibilities: A token set (colors, spacing, accent glow) resolved to CSS custom properties plus a theme resolver in src/lib/theme/; a library of presentational primitives in src/components/ui/ (page mastheads, filter bars, sortable column headers, stat grids, aside blocks, the node selection panel, sparklines). Pages compose these primitives, so a visual change is a token edit rather than an N-page sweep.

Electron wrapper (`viz/electron/`)

Purpose: Optional native window plus an IPC endpoint that runs prompts/update.md and prompts/sentiment.md headlessly via the local LLM client.
Key responsibilities: Starts Next.js dev or built server on port 3737, waits for the URL to respond, opens a BrowserWindow. The preload script exposes a Refresh button that triggers the backend prompts without leaving the app.

Data Flow

Seed (one-time) — User pastes prompts/bootstrap.md into the LLM client. The agent calls tool_get_chats and tool_get_recent_messages against the messages MCP server, computes per-contact stats locally, and writes Person entities + relations to the memory MCP server. Memory appends to memory.jsonl.
Update (periodic) — prompts/update.md re-runs the extraction for new messages since the last last_contacted, then uses delete_observations + add_observations to replace tagged lines (not append duplicates).
Sentiment (weekly) — prompts/sentiment.md scores each top contact per month, appends a JSONL row to viz/data/sentiment.jsonl, and writes a [sent] observation to the Person entity.
View — User runs npm run dev (or npm run app). Each route calls loadGraph() (memoized per request) plus any SQLite reads it needs. Cytoscape renders the network; tables sort client-side; charts compute over the loaded graph.
Refresh from the app — In Electron, clicking the Refresh button IPC-calls main.js, which runs the LLM client headlessly with the contents of update.md and sentiment.md piped in, then waits for completion.

External Integrations

Service	Purpose	Documentation
`mac-messages-mcp`	MCP server exposing iMessage read tools	pypi.org/project/mac-messages-mcp
`@modelcontextprotocol/server-memory`	MCP server providing the knowledge-graph store	npm/server-memory
macOS Messages (`chat.db`)	Source-of-truth for message history	Apple-internal SQLite schema
macOS AddressBook	Contact-name resolution for unknown handles	Apple-internal SQLite schema
Local MCP-capable LLM client	Runs the prompts; called headlessly by the Electron Refresh button	Any agent that loads `.mcp.json`

Key Architectural Decisions

Two MCP servers instead of one custom integration

Context: Both iMessage reads and the knowledge graph already had off-the-shelf MCP servers.
Decision: Wire mac-messages-mcp and @modelcontextprotocol/server-memory via .mcp.json and write project-specific logic as Markdown prompts.
Rationale: Keeps the repo focused on the schema convention and visualization; no custom server code to maintain. The prompts encode the workflow, the conventions encode the schema.

JSONL knowledge graph, not a database

Context: Needed a persistent store the memory MCP server already writes to, and that the frontend can re-read.
Decision: Use the memory server's native memory.jsonl file directly from the Next.js loader.
Rationale: Zero schema migrations, easy to inspect by hand, append-only writes match the incremental update flow. Trade-off: full re-parse on every request (acceptable for personal-scale data).

Tagged observations with replace-the-whole-line semantics

Context: Updates needed to refresh stats without endlessly appending duplicate observations.
Decision: Each Person entity carries exactly one [freq] …, one [topic] …, one [tone] …, one [bio] …, and optionally one [sent] … line. Updates delete_observations for the tag and add_observations the new value.
Rationale: Predictable parsing in graph.ts (just .find(o => o.startsWith("[topic]"))); idempotent updates; human-readable in the JSONL.

Read SQLite directly from the frontend

Context: Some views (gallery thumbnails, per-person message body, attachment IDs) need raw message data the MCP server doesn't expose.
Decision: Open chat.db and AddressBook DBs read-only with better-sqlite3 in server components.
Rationale: Direct path avoids tunneling everything through the MCP layer. Marked server-only so it never ships to the browser. Trade-off: tightly couples the app to macOS file paths.

Shared design substrate over per-page styling

Context: ~18 routes each combine charts, tables, and filters; styling them independently drifts quickly and is hard to keep consistent.
Decision: Centralize a token-based theme (src/lib/theme/) and a primitives library (src/components/ui/); each route is a thin composition of primitives, with interactivity (sorting, network filters, selection) isolated in small "use client" wrappers while data loading stays in server components.
Rationale: One palette and one set of interaction patterns across every page; a restyle is a token edit, not a sweep across pages. Keeping interaction in dedicated client wrappers lets the data-loading pages remain server components that read SQLite directly and pass already-rendered content (e.g. an aside) down as props, so the client bundle stays small.

Electron wrapper as opt-in, not the default

Context: A native window is nicer for a personal daily-driver, but adds a heavy dependency.
Decision: Keep electron in devDependencies; npm run app is opt-in. The browser flow (npm run dev) remains the canonical path.
Rationale: Lets contributors run the app without installing Electron; the wrapper exists mainly so the Refresh button can shell out to the LLM client from a single UI surface.

Back to All Projects

Orbit

Project Q&A

Overview

Problem Solved

Target Users

Key Features

Tagged-observation graph schema

Network visualization

Per-person deep dive (/person/[name])

Drift detection

Wrapped slideshow

Electron wrapper

Technical Highlights

MCP servers as the integration layer

Tag-replace semantics for idempotent updates

Reading SQLite from server components

Entity disambiguation via the /hygiene view

One design substrate across ~18 routes

Engineering Decisions

Two MCP servers instead of a custom integration

JSONL knowledge graph, not a database

Tagged observations with replace-the-line semantics

Read SQLite directly from the frontend

Electron wrapper as opt-in, not the default

Frequently Asked Questions

How is the graph actually built?

Why MCP servers instead of just a script?

Why not use a real database for the graph?

How does sentiment scoring work?

What happens to personal data?

Why is the Electron wrapper optional?

How does Orbit handle group chats?

Technology Stack

Core Technologies

Frontend

Backend / Data Layer

MCP Layer

Desktop Wrapper (optional)

Infrastructure

Development Tools

Key Dependencies

Conventions Worth Knowing

Architecture Overview

System Diagram

Component Descriptions

MCP layer (.mcp.json)

Prompt-driven workflow (prompts/)

Memory server data file

Graph loader (viz/src/lib/graph.ts)

SQLite readers (viz/src/lib/sqlite.ts)

Frontend pages (viz/src/app/*)

Design substrate (viz/src/lib/theme/ + viz/src/components/ui/)

Electron wrapper (viz/electron/)

Data Flow

External Integrations

Key Architectural Decisions

Two MCP servers instead of one custom integration

JSONL knowledge graph, not a database

Tagged observations with replace-the-whole-line semantics

Read SQLite directly from the frontend

Shared design substrate over per-page styling

Electron wrapper as opt-in, not the default

Per-person deep dive (`/person/[name]`)

Entity disambiguation via the `/hygiene` view

MCP layer (`.mcp.json`)

Prompt-driven workflow (`prompts/`)

Graph loader (`viz/src/lib/graph.ts`)

SQLite readers (`viz/src/lib/sqlite.ts`)

Frontend pages (`viz/src/app/*`)

Design substrate (`viz/src/lib/theme/` + `viz/src/components/ui/`)

Electron wrapper (`viz/electron/`)