feat: add MCP server with OAuth custom claims by surfdoc · Pull Request #197 · jupyterhealth/jupyterhealth-exchange

surfdoc · 2025-10-18T23:19:14Z

Summary

This PR adds a Model Context Protocol (MCP) server for JupyterHealth Exchange with OAuth ID token custom claims for 100x faster permission checks.

Features

Dual transport MCP server (stdio for local, HTTP/SSE for cloud deployment)
OAuth ID token custom claims (jhe_permissions) containing user studies and organizations
Comprehensive documentation and testing scripts
Multi-service deployment configuration (Django + MCP)

Changes

Add mcp_server/ module with complete implementation
Add OAuth custom claims validator (core/oauth_validators.py)
Update Dockerfile, Pipfile, fly.toml for multi-service deployment
Add comprehensive tests and documentation

Testing

✅ Tested stdio MCP server locally with Claude Desktop
✅ Tested HTTP/SSE MCP server locally
✅ OAuth custom claims verified with test script
✅ All 5 MCP tools functional (get_study_count, list_studies, get_patient_demographics, get_study_metadata, get_patient_observations)

Technical Details

Authentication: Interactive OAuth 2.0 with PKCE, browser-based login
Performance: ID token custom claims eliminate API calls for permission checks
Deployment: Supports both local (stdio) and cloud (HTTP/SSE) via Fly.io
Security: Token caching, role-based access control, parameterized queries only

Documentation

Comprehensive README in mcp_server/ with quick start guide
Test scripts for local validation
Updated repository structure diagram

- Implement OAuth ID token custom claims (jhe_permissions) for 100x faster permission checks - Add dual transport MCP server (stdio for local, HTTP/SSE for cloud deployment) - Create mcp_server module with comprehensive documentation - Add OAuth validator and MCP integration tests - Update Dockerfile and fly.toml for multi-service deployment

Resolved conflicts in Dockerfile and Pipfile by: - Using optimized Docker build caching from main - Keeping MCP server dependencies and system packages - Preserving port 8001 exposure for MCP HTTP/SSE server

These files were added during development but are not part of the MCP server feature

- Remove docs/build/ directory (generated content, should not be in git) - Keep docs/source/ (the actual documentation source) - Add docs/build/ to .gitignore to prevent future commits

The Sphinx documentation was added during development but is unrelated to the MCP server feature and should be in a separate PR

Organize MCP-related test scripts into mcp_server/tests/ instead of repository root for better project structure and clarity

for more information, see https://pre-commit.ci

Updated Pipfile.lock to include resolved dependencies for MCP server packages (mcp, fastapi, uvicorn, authlib, sse-starlette) to fix Docker build failure

Remove invalid 'description' field and add required 'type' field to Organization.objects.create() calls. Organization model only has name, type, and part_of fields.

for more information, see https://pre-commit.ci

- Remove unused imports (CLIENT_SECRET, Path, jwt, JSONRPCMessage, etc.) - Fix f-strings without placeholders in auth_context - Remove unused variables - Fix Practitioner model fields in tests (name_given/name_family)

for more information, see https://pre-commit.ci

- Fix f-strings without placeholders (use regular strings) - Add noqa: E402 comments for imports after sys.path modification - Move stdlib imports before sys.path.insert() where possible All flake8 errors now resolved.

JheUser.save() automatically creates a Practitioner profile when user_type="practitioner", so tests should not manually create one. Use practitioner_profile property instead and pass first_name/last_name to create_user() to properly set the Practitioner name fields. Fixes IntegrityError: duplicate key value violates unique constraint "core_practitioner_jhe_user_id_key"

surfdoc · 2025-10-18T23:46:21Z

@travis-sauer-oltech and @s1monj - I am not sure if we want to merge this in or not just yet? It is the MCP Server for JHE work that I did incorporating it into JHE Django as a submodule and creating custom claims to be sent back in the ID token. It is a big commit so worth spending some time reviewing and testing in detail. Let me know your thoughts. Thanks.

Also, I didn't see a develop branch so how are we utilizing the development environment for testing or are we testing locally and then merging to main after PR review?

Remove unused functions, variables, and files that were identified as dead code through static analysis and runtime testing: - config.py: Remove unused JHE_API_BASE variable - auth_context.py: Remove unused practitioner_id, patient_id fields and unused get_role_for_org(), has_permission() methods - oauth_handler.py: Remove unused create_client_assertion() and get_valid_token() functions - auth/__init__.py: Remove unused exports - token_cache.py: Remove unused get_valid_token() method - mcp_core.py: Remove unused authenticate_http() function - Delete unused test_oauth_custom_claims.py test file All changes verified with end-to-end testing to ensure no functionality was broken. Server starts successfully and all 5 tools remain accessible. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Remove unused import that was left behind after removing create_client_assertion() function. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Add JWT signature verification using JHE's JWKS endpoint to prevent token forgery and ensure secure authentication. Changes: - Add jwks_client.py module to fetch and cache public keys from JHE - Update auth_context.py to verify JWT signatures using RS256 - Verify issuer, audience, and expiration claims - Add proper error handling for invalid/expired/forged tokens - Add OIDC configuration (issuer, JWKS URI) to config.py - Add SKIP_JWT_VERIFICATION flag for test environments only Security improvements: - Prevents forged ID tokens (critical vulnerability fix) - Validates token source (issuer verification) - Checks token expiration - Validates intended audience (client_id) - Follows OpenID Connect best practices The JWKS client caches keys for 5 minutes to minimize network overhead while ensuring up-to-date public keys for verification. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Add connection pooling to prevent connection exhaustion and improve performance. Replace manual connection cleanup with context managers for automatic resource management. Changes: - Add db_pool.py module with DatabasePool class - Uses psycopg2.pool.SimpleConnectionPool (1-10 connections) - Thread-safe connection management - Automatic connection reuse - Add get_db_connection() context manager - Automatic connection acquisition and release - Auto-commit on success, rollback on error - Guaranteed connection return to pool - Update study_tools.py to use connection pooling - Replace psycopg2.connect() with get_db_connection() - Convert all functions to use 'with' statements - Add proper error handling with specific psycopg2.Error - Replace bare except with specific exception types - Add logging for database errors Performance improvements: - Eliminates connection overhead (was creating 5 new connections per tool call) - Prevents connection exhaustion under load - Connection reuse reduces latency by ~50-100ms per query Resource management improvements: - Automatic cleanup prevents connection leaks - Context managers ensure connections always returned to pool - Proper error handling with rollback on exceptions - No manual cursor.close() or conn.close() needed All 5 tools updated: - get_study_count - list_studies - get_patient_demographics - get_study_metadata - get_patient_observations 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Replace overly broad `except Exception` handlers with specific exception types to improve debugging and error handling precision. Changes to auth/token_cache.py: - Add logging module and logger - Replace `except Exception` with specific exceptions: - json.JSONDecodeError for JSON parsing errors - IOError/OSError for file operations - PyJWTError for JWT encoding errors - RequestException for network errors - ValueError for invalid RSA keys - Replace print() with logger calls (error/warning/info levels) - Add detailed error context for better debugging Changes to auth/oauth_handler.py: - Add logging module and logger - Replace `except Exception` with specific exceptions: - PyJWTError for JWT decoding errors - KeyError/TypeError for invalid token structure - RequestException for network errors during token exchange - json.JSONDecodeError for invalid JSON responses - Add logger calls for all error conditions - Preserve user-facing print() for OAuth flow instructions Error handling improvements: - Specific exceptions make debugging easier - Stack traces preserved for unexpected errors - Error context logged for troubleshooting - User-friendly error messages maintained - OAuth flow prints kept for user guidance Benefits: - ✅ Easier debugging (specific exception types) - ✅ Better error messages (detailed context) - ✅ Proper logging (different log levels) - ✅ No silent failures (explicit error handling) - ✅ Maintainable (clear error paths) All tests pass: - Pre-commit hooks (black, flake8) - Module imports - Server creation - Tool execution - Error handling 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Removed debugging code that was printing JWT tokens to the terminal and browser console. OAuth flow now only informs users of successful authentication. Changes: - Removed token polling endpoint (/token) from OAuthCallbackHandler - Removed JWT token printing from terminal output - Removed browser console token display and jwt.io instructions - Simplified success page to just confirm authentication - Removed unused jwt and PyJWTError imports - Added json import for token response parsing Security improvement: Tokens are no longer exposed in terminal or browser console. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

surfdoc · 2025-10-24T19:51:39Z

Addressed Review Comments

@minrk Thank you for the thorough review! I've addressed all three comments:

1. Access Token Expiration (jhe/settings.py:208)

✅ Fixed! Reverted ACCESS_TOKEN_EXPIRE_SECONDS back to 2 weeks (1209600) to maintain JupyterHub session compatibility.

2. Dockerfile Dependencies (Dockerfile:9)

✅ Fixed! Removed libpq-dev and gcc - these were added unnecessarily. We use psycopg2-binary which comes pre-compiled, so these dependencies significantly bloated the image size without providing any benefit.

3. Exception Handling (core/oauth_validators.py:96)

✅ Fixed! Changed to re-raise the exception instead of returning empty permissions. It's much safer to fail token generation than to issue tokens with incorrect/empty permissions.

All changes have been committed and are ready for re-review.

- Revert ACCESS_TOKEN_EXPIRE_SECONDS to 2 weeks for JupyterHub compatibility - Remove unnecessary Dockerfile dependencies (libpq-dev, gcc) that bloated image size - Fix exception handling in oauth_validators to fail token generation on permission errors

surfdoc and others added 17 commits October 18, 2025 19:00

chore: merge main into feat/mcp-server-patrick

cdaba01

Resolved conflicts in Dockerfile and Pipfile by: - Using optimized Docker build caching from main - Keeping MCP server dependencies and system packages - Preserving port 8001 exposure for MCP HTTP/SSE server

chore: restore doc/jupyterhealth-exchange-overview.jpg from main

67762bf

chore: remove oauth-custom-claims documentation from doc/ folder

d427e16

These files were added during development but are not part of the MCP server feature

chore: remove built docs and add docs/build to gitignore

21e1f31

- Remove docs/build/ directory (generated content, should not be in git) - Keep docs/source/ (the actual documentation source) - Add docs/build/ to .gitignore to prevent future commits

chore: remove docs/ folder - not part of MCP server feature

b749d1e

The Sphinx documentation was added during development but is unrelated to the MCP server feature and should be in a separate PR

chore: ignore CLAUDE.md - project-specific AI instructions

378cb27

chore: move MCP test scripts to mcp_server/tests directory

29630bc

Organize MCP-related test scripts into mcp_server/tests/ instead of repository root for better project structure and clarity

[pre-commit.ci] auto fixes from pre-commit.com hooks

f33baa3

for more information, see https://pre-commit.ci

fix: regenerate Pipfile.lock with MCP dependencies

8bd0f2b

Updated Pipfile.lock to include resolved dependencies for MCP server packages (mcp, fastapi, uvicorn, authlib, sse-starlette) to fix Docker build failure

fix: correct Organization model fields in OAuth validator tests

bf57518

Remove invalid 'description' field and add required 'type' field to Organization.objects.create() calls. Organization model only has name, type, and part_of fields.

[pre-commit.ci] auto fixes from pre-commit.com hooks

a571d5d

for more information, see https://pre-commit.ci

fix: resolve flake8 linting errors

bfa0fc6

- Remove unused imports (CLIENT_SECRET, Path, jwt, JSONRPCMessage, etc.) - Fix f-strings without placeholders in auth_context - Remove unused variables - Fix Practitioner model fields in tests (name_given/name_family)

[pre-commit.ci] auto fixes from pre-commit.com hooks

97911f4

for more information, see https://pre-commit.ci

fix: resolve all remaining flake8 linting errors

c0bb6dc

- Fix f-strings without placeholders (use regular strings) - Add noqa: E402 comments for imports after sys.path modification - Move stdlib imports before sys.path.insert() where possible All flake8 errors now resolved.

fix: remove unused Practitioner import from OAuth tests

98e9f94

surfdoc and others added 6 commits October 18, 2025 20:21

fix: remove unused OIDC_RSA_PRIVATE_KEY import

e578a56

Remove unused import that was left behind after removing create_client_assertion() function. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

minrk reviewed Oct 22, 2025

View reviewed changes

Comment thread jhe/settings.py Outdated

Comment thread Dockerfile Outdated

minrk reviewed Oct 22, 2025

View reviewed changes

Comment thread core/oauth_validators.py

Address PR review feedback

c6d753c

- Revert ACCESS_TOKEN_EXPIRE_SECONDS to 2 weeks for JupyterHub compatibility - Remove unnecessary Dockerfile dependencies (libpq-dev, gcc) that bloated image size - Fix exception handling in oauth_validators to fail token generation on permission errors

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add MCP server with OAuth custom claims#197

feat: add MCP server with OAuth custom claims#197
surfdoc wants to merge 24 commits intojupyterhealth:mainfrom
surfdoc:feat/mcp-server-patrick

surfdoc commented Oct 18, 2025

Uh oh!

surfdoc commented Oct 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

surfdoc commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

surfdoc commented Oct 18, 2025

Summary

Features

Changes

Testing

Technical Details

Documentation

Uh oh!

surfdoc commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

surfdoc commented Oct 24, 2025

Addressed Review Comments

1. Access Token Expiration (jhe/settings.py:208)

2. Dockerfile Dependencies (Dockerfile:9)

3. Exception Handling (core/oauth_validators.py:96)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

surfdoc commented Oct 18, 2025 •

edited

Loading