Backend Debugging Guide

Last Updated: 2025-11-27 Component: services/api-gateway/

Symptoms

500 Internal Server Error

Likely Causes:

Unhandled exception in request handler
Database connection timeout
Missing required environment variable
External service failure (OpenAI, Qdrant)

Steps to Investigate:

Check API Gateway logs (Docker container):

docker logs voiceassist-server --tail 100 2>&1 | grep -i error

Look for stack traces:

docker logs voiceassist-server --since "10m" 2>&1 | grep -A 20 "Traceback"

Check health endpoints:

curl http://localhost:8000/health
curl http://localhost:8000/ready

Verify environment variables:

# Check if critical vars are set in .env
grep -E "DATABASE_URL|REDIS_URL|OPENAI_API_KEY" /home/asimo/VoiceAssist/.env

Relevant Logs:

docker logs voiceassist-server
Structured JSON logs with trace_id

Relevant Code Paths:

services/api-gateway/app/main.py - Exception handlers
services/api-gateway/app/core/exceptions.py - Custom exceptions
services/api-gateway/app/api/*.py - Route handlers

401 Unauthorized

Likely Causes:

JWT token expired
Token missing from request
Token signed with wrong key
User revoked or deactivated

Steps to Investigate:

Decode the JWT (without verifying):

# Extract token from Authorization header
echo "YOUR_JWT_TOKEN" | cut -d'.' -f2 | base64 -d 2>/dev/null | jq .

Check token expiration:

# Look at 'exp' claim - Unix timestamp

Verify JWT secret matches:

# Compare JWT_SECRET_KEY in env with what was used to sign

Check if user is active:

SELECT id, email, is_active FROM users WHERE id = 'USER_ID';

Relevant Code Paths:

services/api-gateway/app/core/security.py - JWT verification
services/api-gateway/app/core/dependencies.py - Auth dependencies

503 Service Unavailable

Likely Causes:

Database connection pool exhausted
Redis not responding
Qdrant vector store down
External API rate limited

Steps to Investigate:

Check database connectivity:

# PostgreSQL
psql -h localhost -U voiceassist -d voiceassist -c "SELECT 1"

# Check connection count
psql -c "SELECT count(*) FROM pg_stat_activity WHERE datname = 'voiceassist'"

Check Redis:

redis-cli ping
redis-cli info clients

Check Qdrant:

curl http://localhost:6333/collections

Check API Gateway connection pool:

curl http://localhost:8000/metrics | grep "db_connection"

Relevant Code Paths:

services/api-gateway/app/core/database.py - DB connection
services/api-gateway/app/services/cache_service.py - Redis
services/api-gateway/app/services/vector_store_service.py - Qdrant

Database Issues

Connection Pool Exhaustion

Symptoms:

Requests hanging
Timeout errors
"too many connections" in logs

Investigation:

# Check active connections
psql -c "SELECT count(*), state FROM pg_stat_activity WHERE datname = 'voiceassist' GROUP BY state"

# Find long-running queries
psql -c "SELECT pid, now() - pg_stat_activity.query_start AS duration, query FROM pg_stat_activity WHERE datname = 'voiceassist' AND state != 'idle' ORDER BY duration DESC"

# Kill stuck query if needed
psql -c "SELECT pg_terminate_backend(PID)"

Fix:

Increase pool_size in database config
Add connection timeout
Check for leaked connections in code

Migration Issues

# Check current migration version
cd services/api-gateway
alembic current

# Check migration history
alembic history

# Run pending migrations
alembic upgrade head

# Rollback if needed
alembic downgrade -1

Cache Issues

Redis Not Responding

Symptoms:

Cache misses everywhere
Slower response times
Session lookup failures

Investigation:

# Check Redis status
sudo systemctl status redis-server
redis-cli ping

# Check memory
redis-cli info memory

# Check connected clients
redis-cli client list

# Check slow log
redis-cli slowlog get 10

Relevant Code Paths:

services/api-gateway/app/services/cache_service.py
services/api-gateway/app/core/config.py - REDIS_URL

OpenAI API Issues

Rate Limiting / Quota Exceeded

Symptoms:

429 errors from OpenAI
Empty AI responses
Timeout waiting for completion

Investigation:

Check recent OpenAI calls:

docker logs voiceassist-server --since "1h" 2>&1 | grep -i "openai\|rate\|429"

Check API key validity:

curl https://api.openai.com/v1/models \
  -H "Authorization: Bearer $OPENAI_API_KEY"

Check usage dashboard: https://platform.openai.com/usage

Relevant Code Paths:

services/api-gateway/app/services/llm_client.py
services/api-gateway/app/core/config.py - OPENAI_API_KEY

RAG Pipeline Issues

Poor Search Results

Symptoms:

Irrelevant document retrieval
Empty results for valid queries
Low confidence scores

Investigation:

Check vector store health:

curl http://localhost:6333/collections/medical_docs

Test embedding generation:

# In Python shell
from app.services.embedding_service import EmbeddingService
svc = EmbeddingService()
embedding = await svc.embed_text("test query")
print(len(embedding))  # Should be 1536 for OpenAI ada-002

Check document count:

curl http://localhost:6333/collections/medical_docs | jq '.result.points_count'

Relevant Code Paths:

services/api-gateway/app/services/rag_service.py
services/api-gateway/app/services/embedding_service.py
services/api-gateway/app/services/vector_store_service.py

Metrics to Monitor

Metric	Normal Range	Alert Threshold
`http_request_duration_seconds`	< 500ms	> 2s
`db_connection_pool_size`	5-20	> 80% used
`http_requests_total{status=5xx}`	0	> 10/min
`redis_connection_errors`	0	> 0

Backend Debugging Guide

Symptoms

500 Internal Server Error

401 Unauthorized

503 Service Unavailable

Database Issues

Connection Pool Exhaustion

Migration Issues

Cache Issues

Redis Not Responding

OpenAI API Issues

Rate Limiting / Quota Exceeded

RAG Pipeline Issues

Poor Search Results

Metrics to Monitor

Related Documentation