Backend Debugging Guide

Last Updated: 2025-11-27 Component: services/api-gateway/

Symptoms

500 Internal Server Error

Likely Causes:

Unhandled exception in request handler
Database connection timeout
Missing required environment variable
External service failure (OpenAI, Qdrant)

Steps to Investigate:

Check API Gateway logs (Docker container):

docker logs voiceassist-server --tail 100 2>&1 | grep -i error

Look for stack traces:

docker logs voiceassist-server --since "10m" 2>&1 | grep -A 20 "Traceback"

Check health endpoints:

curl http://localhost:8000/health
curl http://localhost:8000/ready

Verify environment variables:

# Check if critical vars are set in .env
grep -E "DATABASE_URL|REDIS_URL|OPENAI_API_KEY" /home/asimo/VoiceAssist/.env

Relevant Logs:

docker logs voiceassist-server
Structured JSON logs with trace_id

Relevant Code Paths:

services/api-gateway/app/main.py - Exception handlers
services/api-gateway/app/core/exceptions.py - Custom exceptions
services/api-gateway/app/api/*.py - Route handlers

401 Unauthorized

Likely Causes:

JWT token expired
Token missing from request
Token signed with wrong key
User revoked or deactivated

Steps to Investigate:

Decode the JWT (without verifying):

# Extract token from Authorization header
echo "YOUR_JWT_TOKEN" | cut -d'.' -f2 | base64 -d 2>/dev/null | jq .

Check token expiration:

# Look at 'exp' claim - Unix timestamp

Verify JWT secret matches:

# Compare JWT_SECRET_KEY in env with what was used to sign

Check if user is active:

SELECT id, email, is_active FROM users WHERE id = 'USER_ID';

Relevant Code Paths:

services/api-gateway/app/core/security.py - JWT verification
services/api-gateway/app/core/dependencies.py - Auth dependencies

503 Service Unavailable

Likely Causes:

Database connection pool exhausted
Redis not responding
Qdrant vector store down
External API rate limited

Steps to Investigate:

Check database connectivity:

# PostgreSQL
psql -h localhost -U voiceassist -d voiceassist -c "SELECT 1"

# Check connection count
psql -c "SELECT count(*) FROM pg_stat_activity WHERE datname = 'voiceassist'"

Check Redis:

redis-cli ping
redis-cli info clients

Check Qdrant:

curl http://localhost:6333/collections

Check API Gateway connection pool:

curl http://localhost:8000/metrics | grep "db_connection"

Relevant Code Paths:

services/api-gateway/app/core/database.py - DB connection
services/api-gateway/app/services/cache_service.py - Redis
services/api-gateway/app/services/vector_store_service.py - Qdrant

Database Issues

Connection Pool Exhaustion

Symptoms:

Requests hanging
Timeout errors
"too many connections" in logs

Investigation:

# Check active connections
psql -c "SELECT count(*), state FROM pg_stat_activity WHERE datname = 'voiceassist' GROUP BY state"

# Find long-running queries
psql -c "SELECT pid, now() - pg_stat_activity.query_start AS duration, query FROM pg_stat_activity WHERE datname = 'voiceassist' AND state != 'idle' ORDER BY duration DESC"

# Kill stuck query if needed
psql -c "SELECT pg_terminate_backend(PID)"

Fix:

Increase pool_size in database config
Add connection timeout
Check for leaked connections in code

Migration Issues

# Check current migration version
cd services/api-gateway
alembic current

# Check migration history
alembic history

# Run pending migrations
alembic upgrade head

# Rollback if needed
alembic downgrade -1

Cache Issues

Redis Not Responding

Symptoms:

Cache misses everywhere
Slower response times
Session lookup failures

Investigation:

# Check Redis status
sudo systemctl status redis-server
redis-cli ping

# Check memory
redis-cli info memory

# Check connected clients
redis-cli client list

# Check slow log
redis-cli slowlog get 10

Relevant Code Paths:

services/api-gateway/app/services/cache_service.py
services/api-gateway/app/core/config.py - REDIS_URL

OpenAI API Issues

Rate Limiting / Quota Exceeded

Symptoms:

429 errors from OpenAI
Empty AI responses
Timeout waiting for completion

Investigation:

Check recent OpenAI calls:

docker logs voiceassist-server --since "1h" 2>&1 | grep -i "openai\|rate\|429"

Check API key validity:

curl https://api.openai.com/v1/models \
  -H "Authorization: Bearer $OPENAI_API_KEY"

Check usage dashboard: https://platform.openai.com/usage

Relevant Code Paths:

services/api-gateway/app/services/llm_client.py
services/api-gateway/app/core/config.py - OPENAI_API_KEY

RAG Pipeline Issues

Poor Search Results

Symptoms:

Irrelevant document retrieval
Empty results for valid queries
Low confidence scores

Investigation:

Check vector store health:

curl http://localhost:6333/collections/medical_docs

Test embedding generation:

# In Python shell
from app.services.embedding_service import EmbeddingService
svc = EmbeddingService()
embedding = await svc.embed_text("test query")
print(len(embedding))  # Should be 1536 for OpenAI ada-002

Check document count:

curl http://localhost:6333/collections/medical_docs | jq '.result.points_count'

Relevant Code Paths:

services/api-gateway/app/services/rag_service.py
services/api-gateway/app/services/embedding_service.py
services/api-gateway/app/services/vector_store_service.py

Metrics to Monitor

Metric	Normal Range	Alert Threshold
`http_request_duration_seconds`	< 500ms	> 2s
`db_connection_pool_size`	5-20	> 80% used
`http_requests_total{status=5xx}`	0	> 10/min
`redis_connection_errors`	0	> 0

Backend Debugging Guide

Backend Debugging Guide

Symptoms

500 Internal Server Error

401 Unauthorized

503 Service Unavailable

Database Issues

Connection Pool Exhaustion

Migration Issues

Cache Issues

Redis Not Responding

OpenAI API Issues

Rate Limiting / Quota Exceeded

RAG Pipeline Issues

Poor Search Results

Metrics to Monitor

Related Documentation