HTTP API Reference

Nancy Brain provides a REST API for programmatic access to the knowledge base.

Quick Start

Start the HTTP API server:

python -m nancy_brain.connectors.http_api --port 8000

The API will be available at http://localhost:8000.

API Endpoints

Search Documents

POST /search

Search the knowledge base for relevant documents.

curl -X POST http://localhost:8000/search \
  -H "Content-Type: application/json" \
  -d '{
    "query": "machine learning algorithms",
    "limit": 5,
    "threshold": 0.7
  }'

Parameters: - query (string, required): Search query - limit (integer, optional): Maximum results to return (default: 6) - threshold (number, optional): Minimum relevance score (default: 0.0) - toolkit (string, optional): Filter by toolkit name - doctype (string, optional): Filter by document type

Response:

{
  "results": [
    {
      "id": "repo/path/to/file.py",
      "text": "relevant content...",
      "score": 0.85,
      "github_url": "https://github.com/user/repo/blob/main/path/to/file.py"
    }
  ]
}

Retrieve Document

GET /documents/{doc_id}

Retrieve a specific document passage.

curl "http://localhost:8000/documents/repo%2Fpath%2Fto%2Ffile.py?start=10&end=20"

Parameters: - doc_id (string, required): Document identifier (URL-encoded) - start (integer, optional): Starting line number (default: 0) - end (integer, optional): Ending line number (default: full document)

Response:

{
  "doc_id": "repo/path/to/file.py",
  "text": "document content...",
  "start": 10,
  "end": 20,
  "github_url": "https://github.com/user/repo/blob/main/path/to/file.py"
}

Batch Retrieve

POST /documents/batch

Retrieve multiple document passages in one request.

curl -X POST http://localhost:8000/documents/batch \
  -H "Content-Type: application/json" \
  -d '{
    "items": [
      {"doc_id": "repo/file1.py", "start": 0, "end": 10},
      {"doc_id": "repo/file2.py", "start": 5, "end": 15}
    ]
  }'

List Documents

GET /documents

Get a hierarchical list of all documents in the knowledge base.

curl "http://localhost:8000/documents?max_depth=3&path=microlensing_tools"

Parameters: - max_depth (integer, optional): Maximum tree depth (default: 3) - path (string, optional): Filter by path prefix

Health Check

GET /health

Check the health and status of the system.

curl http://localhost:8000/health

Response:

{
  "status": "ok",
  "index_version": "1.0.0",
  "documents_count": 1234,
  "last_updated": "2025-08-24T12:00:00Z"
}

Authentication

Currently, the HTTP API does not require authentication. In production deployments, you should add authentication and authorization layers.

Rate Limiting

The API includes basic rate limiting to prevent abuse. Default limits: - 100 requests per minute per IP - 1000 requests per hour per IP

Error Handling

The API returns standard HTTP status codes:

200 - Success
400 - Bad Request (invalid parameters)
404 - Not Found (document doesn't exist)
429 - Too Many Requests (rate limited)
500 - Internal Server Error

Error responses include details:

{
  "error": "Document not found",
  "code": "DOCUMENT_NOT_FOUND",
  "details": "No document with id 'invalid/path.py' exists in the knowledge base"
}

Client Examples

Python

import requests

# Search for documents
response = requests.post("http://localhost:8000/search", json={
    "query": "neural networks",
    "limit": 5
})
results = response.json()["results"]

# Retrieve a specific document
doc_id = results[0]["id"]
response = requests.get(f"http://localhost:8000/documents/{doc_id}")
document = response.json()

JavaScript

// Search for documents
const searchResponse = await fetch('http://localhost:8000/search', {
  method: 'POST',
  headers: {'Content-Type': 'application/json'},
  body: JSON.stringify({
    query: 'machine learning',
    limit: 5
  })
});
const results = await searchResponse.json();

// Retrieve a document
const docId = encodeURIComponent(results.results[0].id);
const docResponse = await fetch(`http://localhost:8000/documents/${docId}`);
const document = await docResponse.json();

Configuration

The HTTP API server can be configured via environment variables:

NANCY_BRAIN_HOST - Server host (default: "localhost")
NANCY_BRAIN_PORT - Server port (default: 8000)
NANCY_BRAIN_EMBEDDINGS_PATH - Path to embeddings directory
NANCY_BRAIN_CONFIG_PATH - Path to repositories configuration
NANCY_BRAIN_WEIGHTS_PATH - Path to weights configuration

Deployment

For production deployment, consider:

Reverse Proxy - Use nginx or similar
HTTPS - Enable SSL/TLS encryption
Authentication - Add API key or OAuth
Monitoring - Log requests and performance
Scaling - Use multiple instances behind a load balancer