Full application build: landing page, auth, chat with MCP tool use, harness management, and OAuth by DIodide · Pull Request #8 · DIodide/Harness

DIodide · 2026-03-11T03:56:07Z

Summary

This PR represents the full initial build of Harness, bringing the application from skeleton to a functional AI chat
platform with MCP (Model Context Protocol) integration. Key additions include:

Landing page — Animated landing page with floating dots background, feature highlights, and call-to-action
Authentication & onboarding — Clerk-based sign-in page, onboarding flow with API key setup and initial harness
creation, TanStack Router auth guards for protected routes
Chat interface — Full streaming chat UI with markdown rendering, reasoning/thinking display, tool call
visualization, chat interrupts, auto-scroll, and per-conversation state management
Harness management — Create, configure, and manage harnesses (model + MCP server bundles), harness detail pages,
decoupled harnesses from conversations with last-used tracking
MCP integration — Dynamic HTTP-streamable MCP server connectivity, parallel tool call execution, MCP server health
checks with UI status indicators, and error surfacing to frontend
OAuth for MCP servers — Full OAuth 2.0 flow for MCP servers requiring authentication (GitHub OAuth App support),
token storage and refresh via Convex
FastAPI backend — Chat streaming endpoint via OpenRouter, MCP client service, OAuth routes, Convex persistence
service, structured logging, config validation, connection pooling via httpx.AsyncClient
Convex backend — Schema for harnesses, conversations, messages (with parts, tool calls, usage/cost tracking), MCP
OAuth tokens, and user settings; seed data; composite indexes for recency queries
UI/UX — shadcn/ui component library, display modes (zen/standard/developer), user settings with profile
management, Geist font, toast notifications, dark theme with Clerk appearance customization
Model support — GPT-4o, GPT-4.1, Claude Sonnet/Opus 4, Gemini 2.5 Pro/Flash, DeepSeek R1/V3, Grok 3/3 Mini;
thinking model variants with reasoning parameter support

Changes

81 files changed, ~12,000 insertions, ~670 deletions
Removed deprecated packages/backend in favor of packages/convex-backend
Removed old Header component and Clerk integration files, replaced with route-level auth guards and chromeless route
support

PRs included

Feat/landing page #3 — Landing page
Feat/login and onboarding #4 — Login and onboarding
Feat/initial chat streaming implementation #5 — Initial chat + OAuth implementation
Add chat interrupts, auto scroll #6 — Chat interrupts and scroll
Feat/mcp implementation #7 — MCP implementation

… Header component

…ing - should've been in the staging already

…ng related deps to apps/web

…d per conversation, add usersettings logic

Feat/landing page

Feat/login and onboarding

…nt and user

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

skills.sh implementation

…or switching back to the default.

The skills merge overwrote deployment fixes. This restores: - frontend-cd.yml: use --name flag instead of --env for correct worker targeting - main.py: add harness.nz and workers.dev CORS origins - wrangler.jsonc: add public env vars for both staging and production Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Deployment fixes: CORS, wrangler vars, CD worker targeting

…en creating/editing a Harness

…ness into feat/model_switching

…asis or the entire Harness itself.

Feat/model switching

…laim The frontend was calling getToken() without a template, which returns Clerk's default session token (no email claim). The backend's extract_princeton_netid() needs the email to derive the netid. Fix: use getToken({ template: "convex" }) which includes the email claim configured in the Clerk JWT template. Also add debug logging when netid extraction fails. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Fix: netid not passing to Junction Engine

The JWT email is the user's primary email (e.g. Gmail), not necessarily their Princeton email. When the JWT email isn't @princeton.edu, the backend now calls the Clerk Backend API to check all verified emails on the account. - Add resolve_princeton_netid() async function with Clerk API fallback - Add 5-minute TTL cache so email removals propagate without restart - Requires CLERK_SECRET_KEY env var on the backend Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Fix netid: Clerk API fallback with TTL cache

cole-ramer · 2026-04-01T21:26:30Z

@claude, review this.

claude · 2026-04-01T21:26:50Z

Claude finished @cole-ramer's task in 6m 57s —— View job

PR Review

This is an incremental review on top of the prior Claude review from March 22. I re-read the key files and note what's been fixed, what remains open, and flag new issues introduced by the additions since then (sandbox tools, skills system, etc.).

Tasks

Gather context
Review FastAPI backend (auth, chat, MCP, OAuth)
Review frontend (chat stream, routes, components)
Review Convex schema and queries
Post comprehensive review

What's Been Fixed

Session cache now user-scoped — _session_cache is now keyed by (url, user_id, netid) (mcp_client.py:26), resolving the cross-user session sharing bug from item Feat/initial chat streaming implementation #5 in the prior review.
Hardcoded personal ngrok URL removed — main.py CORS list no longer contains the personal dev machine ngrok URL or the 127.0.0.1:57609 entry.
_tool_name_map key format improved — Keys are now servername__toolname (full namespaced name), so single-server collisions are less likely.

🔴 Critical — Still Open from Previous Review

1. JWT issuer read from unverified token (packages/fastapi/app/auth.py:44-51)

unverified_claims = jwt.decode(token, options={"verify_signature": False})
issuer = unverified_claims.get("iss", "")
jwks = await _get_jwks(http_client, issuer)

The JWKS URL is still derived from the attacker-controlled iss claim. An attacker can host their own JWKS and sign a valid-looking token. Pin the expected issuer in Settings and reject mismatches before fetching. Fix this →

2. Audience verification disabled (packages/fastapi/app/auth.py:76)

options={"verify_aud": False}

Tokens issued for any Clerk application are accepted. Fix this →

3. postMessage wildcard origin (packages/fastapi/app/routes/mcp_oauth.py:249)

window.opener.postMessage({json.dumps(message_data)}, "*");

The OAuth result (including access token provenance) is broadcast to any origin. Pass settings.frontend_url as the second argument. Fix this →

🟠 High — Still Open from Previous Review

4. conversation_id not validated against authenticated user (packages/fastapi/app/routes/chat.py:719)

body.conversation_id is passed directly to save_assistant_message and patch_message_usage with no check that the conversation belongs to user_id. User A could supply User B's conversation ID to inject messages. Validate ownership in the Convex mutation or look up the conversation first.

5. OAuth PKCE state in-process only (packages/fastapi/app/services/mcp_oauth.py:40)

_pending_oauth is module-level. Multi-worker deployments (e.g. uvicorn --workers N) will fail OAuth callbacks that land on a different worker. Move to Redis or Convex.

🟠 High — New Findings

6. Redirect URI built from unvalidated X-Forwarded-* headers (packages/fastapi/app/routes/mcp_oauth.py:52-55, 175-177)

forwarded_proto = request.headers.get("x-forwarded-proto", request.url.scheme)
forwarded_host = request.headers.get("x-forwarded-host") or request.headers.get("host", "localhost:8000")
redirect_uri = f"{forwarded_proto}://{forwarded_host}/api/mcp/oauth/callback"

If the FastAPI server is directly internet-exposed (without a trusted reverse proxy stripping these headers), a client can inject an arbitrary x-forwarded-host to redirect OAuth callbacks to an attacker-controlled host. Use settings.fastapi_base_url (already in config) instead of trusting forwarded headers, or configure a trusted proxy list.

7. Skill content fetching allows arbitrary GitHub requests (packages/fastapi/app/routes/chat.py:142-238)

_fetch_skill_md_from_repo makes HTTP requests to raw.githubusercontent.com and the GitHub API for any source derived from the skill name in the user's harness config. If a user configures a harness with a skill from a repo they control, they can cause the backend to make requests to GitHub on their behalf. While allowed_skills gates which names are accepted, the path components of the skill name (source, skill_id) flow directly into URLs without sanitization. A crafted skill name like evil-org/evil-repo/../../etc won't traverse the filesystem but could target arbitrary GitHub paths. Consider validating that source matches owner/repo format before constructing URLs.

🟡 Medium — Still Open from Previous Review

8. JWKS cache has no TTL (packages/fastapi/app/auth.py:9-22)

_jwks_cache is a global with no expiry. Clerk key rotation only triggers a retry when a specific kid is missing; the cached set of keys can grow stale permanently.

9. Duplicate model IDs in AVAILABLE_MODELS (packages/fastapi/app/config.py:54-55)

{"id": "anthropic/claude-sonnet-4", "name": "Claude Sonnet 4"},
{"id": "anthropic/claude-sonnet-4", "name": "Claude Sonnet 4 (Thinking)"},

Two entries share the same id. Same for Claude Opus 4. Frontend model selectors that key by id will de-duplicate these. Use distinct IDs (e.g. anthropic/claude-sonnet-4:thinking). Fix this →

🟡 Medium — New Findings

10. GitHub OAuth PKCE code_verifier generated but never sent (packages/fastapi/app/services/mcp_oauth.py:802-803)

code_verifier, code_challenge = _generate_pkce_pair()

For the standalone GitHub OAuth flow, code_challenge is not included in the authorization URL params and code_verifier is not sent during token exchange (GitHub OAuth Apps don't support PKCE). The generated PKCE data is stored in _pending_oauth but silently ignored. This is dead code — remove it to avoid confusion about security posture.

11. _tools_cache keyed by server URL only (packages/fastapi/app/services/mcp_client.py:105)

The tools list cache is not user-scoped. For most servers this is fine (tool schemas are the same for all users), but for any server that returns per-user tool definitions, user A's cached tools will be served to user B. Low risk currently, worth a comment.

12. No message content size limit (packages/fastapi/app/routes/chat.py:523)

The full body.messages list is forwarded to OpenRouter on every request with no cap on total tokens or message count. Long conversations can generate unexpectedly large token costs or cause upstream timeouts.

🔵 Low / Nits

13. import re inside _extract_summary (packages/fastapi/app/routes/chat.py:64)

def _extract_summary(detail: str, max_chars: int = 300) -> str:
    import re

re is already imported at the top of the file (line 4). Remove the redundant in-function import.

14. import os inside resolve_princeton_netid (packages/fastapi/app/services/mcp_client.py:73)

Same pattern — os is used once inside a function. Hoist to module level.

15. import base64 inside call_tool (packages/fastapi/app/services/mcp_client.py:503)

Same pattern — move to top-level imports.

16. Bearer tokens stored in plaintext in Convex (packages/convex-backend/convex/schema.ts:17)

authToken on MCP server configs is a plain string. Flagged for future work: these should be encrypted at rest or stored in a secrets manager.

17. messages.userId optional (packages/convex-backend/convex/schema.ts:97)

userId: v.optional(v.string()),

userId should likely be required on messages to prevent orphaned records and simplify user-scoped queries.

18. Harness delete doesn't cascade to conversations (packages/convex-backend/convex/harnesses.ts)

Deleting a harness leaves conversations with a stale lastHarnessId. Acceptable for now but worth a comment or a compensating query to null out the references.

Summary

The codebase has grown considerably and remains well-structured. The session cache isolation fix (item 5 from the prior review) is a meaningful improvement. The two highest-priority unfixed issues remain JWT issuer pinning (#1) and postMessage wildcard (#3). The new redirect URI from unvalidated forwarded headers (#6) is worth fixing before deploying behind anything less than a fully trusted reverse proxy, and the skill content SSRF vector (#7) deserves a sanity-check on the source path before it goes to untrusted users.

Fix/staging cleanup

cole-ramer · 2026-04-06T01:27:42Z

@claude, review this, focus on notifying REAL issues

claude · 2026-04-06T01:27:57Z

Claude finished @cole-ramer's task in 1m 43s —— View job

PR Review

Tasks

Gather context
Read auth.py, chat.py, mcp_client.py, mcp_oauth.py, config.py
Review Convex backend
Post findings

What's Been Fixed Since Prior Reviews

JWT issuer pinned — settings.clerk_issuer is now used directly; unverified iss claim no longer controls JWKS URL ✅
Hardcoded ngrok/dev URLs removed from CORS config ✅
Session cache user-scoped — keyed by (url, user_id, netid) ✅
OAuth redirect URI — now uses settings.fastapi_base_url, not forwarded headers ✅

🔴 Critical — Still Open

1. Audience verification disabled (packages/fastapi/app/auth.py:76)

options={"verify_aud": False}

This accepts JWTs issued for any Clerk application, not just this one. A token from a completely different Clerk app (e.g. a user's personal project) will pass verification. Set clerk_jwt_audience in Settings and verify it.

Fix this →

2. postMessage wildcard origin (packages/fastapi/app/routes/mcp_oauth.py:244)

window.opener.postMessage({json.dumps(message_data)}, "*");

The OAuth result — including the server URL that was just authorized — is broadcast to every open window/tab, regardless of origin. Any malicious page the user has open can intercept it and learn which MCP servers they authenticated with. Replace "*" with settings.frontend_url.

Fix this →

🟠 High — Still Open

3. conversation_id not validated against the authenticated user (packages/fastapi/app/routes/chat.py:719)

body.conversation_id from the client is passed directly to save_assistant_message and patch_message_usage. The Convex saveAssistantMessage is an internalMutation that checks the conversation exists but does not verify convo.userId === user_id (messages.ts:204-205). User A can supply User B's conversation_id and the assistant's full response gets saved into User B's conversation history.

Fix: add an ownership check in the FastAPI route before the agentic loop starts, or add userId validation to the saveAssistantMessage internalMutation.

Fix this →

4. OAuth PKCE state stored in process memory only (packages/fastapi/app/services/mcp_oauth.py:40)

_pending_oauth: dict[str, "PendingOAuth"] = {}

Under multi-worker deployments (uvicorn --workers N), the OAuth callback will randomly land on a different process from the one that stored the state, causing "Invalid or expired OAuth state" errors for ~(N-1)/N of attempts. This is a functional bug under any non-single-process deployment. Move _pending_oauth to Redis or Convex.

🟡 Medium

5. Duplicate model IDs in AVAILABLE_MODELS (packages/fastapi/app/config.py:62–67)

{"id": "anthropic/claude-sonnet-4", "name": "Claude Sonnet 4"},
{"id": "anthropic/claude-sonnet-4", "name": "Claude Sonnet 4 (Thinking)"},
{"id": "anthropic/claude-opus-4",   "name": "Claude Opus 4"},
{"id": "anthropic/claude-opus-4",   "name": "Claude Opus 4 (Thinking)"},

Any frontend code that deduplicates by id (e.g. a <select> keyed by model ID) will silently drop one of the two variants. The correct fix is already present in MODEL_MAP — use "claude-sonnet-4-thinking" / "claude-opus-4-thinking" as the IDs here too.

Fix this →

🔵 Nits (3 in-function imports that should be hoisted)

import re inside _extract_summary — re is already imported at top of file (chat.py:4, chat.py:62)
import os inside resolve_princeton_netid — used once, should be module-level (mcp_client.py:73)
import base64 inside call_tool — should be module-level (mcp_client.py:503)

Summary

Items 1 (audience verification) and 2 (postMessage wildcard) are the two unfixed security issues from prior reviews. Item 3 (conversation ownership) is a real data isolation bug — the saveAssistantMessage internal mutation has no user check. Item 4 is a functional reliability bug under any multi-process deployment. The duplicate model IDs (item 5) will silently break any model selector that keys by ID.

cole-ramer · 2026-04-06T01:46:25Z

Investigating the issues flagged, they appear not to be real issues (sorry if I am wrong)

DIodide and others added 30 commits February 26, 2026 12:23

chk: knot-1

ca554f1

significantly better

7662e6c

to indigo

07652e2

pre commit hook change

ed34360

change to black

1702f5a

add shadcn-ui components from default registry

4567ed0

add logo

6ca6267

add sign-in page

d087562

implement TanStack Router authentication guards and remove deprecated…

b18d060

… Header component

format __root with biome

be1fe34

Add the convex backend path to the tsconfig to include it during lint…

de27980

…ing - should've been in the staging already

add convex-backend as a dep to the web project + add markdown renderi…

9b70d6e

…ng related deps to apps/web

add proper chat page

ebf1e40

biome format

3692db1

style change

fbedbdc

use best practice for convex id typing

64babeb

seed the floating dots to not break SSR

ce12102

add composite index by_user_last_message to support recency filtering

2b8fc0f

Change list harnesses query to use this recency by last message mutation

d95313e

remove double model_config definition

02c04d1

decouple harnesses from conversations, keep track of last harness use…

8401dc8

…d per conversation, add usersettings logic

Add settings + sign out feature + signout ui

8f32744

Merge pull request #3 from DIodide/feat/landing-page

4ef2340

Feat/landing page

Merge pull request #4 from DIodide/feat/login-and-onboarding

5c6b704

Feat/login and onboarding

initial chat-implementation

22c8bd9

biome format

f597e54

delete packages/backend

18675f7

test

1ba5874

add missing dep

c1ea761

add proper error handling and fastapi Depends or retrieving http clie…

1412864

…nt and user

richardw19 and others added 20 commits March 31, 2026 22:55

Add Kimi K2 model

beaca91

Add harness.nz and staging.harness.nz to CORS origins

4dc0e51

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

SHould be merged of staging and skills

b71ac2a

other smaller changes

00c6527

Merge pull request #18 from DIodide/feak/skills.sh

79a3c11

skills.sh implementation

added indication for when overriding default harness model and also f…

fefb9eb

…or switching back to the default.

Merge staging into feat/deployment

17b76ed

Merge pull request #22 from DIodide/feat/deployment

bb4a282

Deployment fixes: CORS, wrangler vars, CD worker targeting

added setting to lock model (or allow model switching per session) wh…

a3f8a44

…en creating/editing a Harness

Merge branch 'staging' into feat/model_switching

e6b7449

revert previous commit

5102f65

Merge branch 'feat/model_switching' of https://github.com/DIodide/Har…

706b18a

…ness into feat/model_switching

Add user setting to toggle between model switching on a per-session b…

3333ba0

…asis or the entire Harness itself.

fix guard on convex mutation and ident style

3683428

Merge pull request #21 from DIodide/feat/model_switching

3d03eda

Feat/model switching

Merge pull request #23 from DIodide/fix/netid-passing

e38ceae

Fix: netid not passing to Junction Engine

Merge pull request #24 from DIodide/fix/netid-passing

c49f60c

Fix netid: Clerk API fallback with TTL cache

cole-ramer and others added 4 commits April 5, 2026 17:39

clerk issuer fix

c5040bf

event listners and an oauth fix

9806532

code cleanup

10607fb

Merge pull request #26 from DIodide/fix/staging-cleanup

4a33c63

Fix/staging cleanup

cole-ramer merged commit 415156c into main Apr 6, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Full application build: landing page, auth, chat with MCP tool use, harness management, and OAuth#8

Full application build: landing page, auth, chat with MCP tool use, harness management, and OAuth#8
cole-ramer merged 162 commits intomainfrom
staging

DIodide commented Mar 11, 2026

Uh oh!

cole-ramer commented Apr 1, 2026

Uh oh!

claude bot commented Apr 1, 2026 •

edited

Loading

Uh oh!

cole-ramer commented Apr 6, 2026

Uh oh!

claude bot commented Apr 6, 2026 •

edited

Loading

Uh oh!

cole-ramer commented Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

DIodide commented Mar 11, 2026

Summary

Changes

PRs included

Uh oh!

cole-ramer commented Apr 1, 2026

Uh oh!

claude bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review

Tasks

What's Been Fixed

🔴 Critical — Still Open from Previous Review

🟠 High — Still Open from Previous Review

🟠 High — New Findings

🟡 Medium — Still Open from Previous Review

🟡 Medium — New Findings

🔵 Low / Nits

Summary

Uh oh!

cole-ramer commented Apr 6, 2026

Uh oh!

claude bot commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review

Tasks

What's Been Fixed Since Prior Reviews

🔴 Critical — Still Open

🟠 High — Still Open

🟡 Medium

🔵 Nits (3 in-function imports that should be hoisted)

Summary

Uh oh!

cole-ramer commented Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

claude bot commented Apr 1, 2026 •

edited

Loading

claude bot commented Apr 6, 2026 •

edited

Loading