Skip to content

Dataflows Hub¶

This hub documents the end-to-end data flows within the Home Security Intelligence system. Each document traces data through specific pathways, including timing information, error handling, and recovery mechanisms.

Dataflows Overview

End-to-End Flow Summary¶

%%{init: {'theme': 'dark'}}%%
flowchart TB
    Camera["Camera Upload (FTP)"]
    FW["File Watcher<br/>(inotify/poll)<br/>Debounce: 0.5s, Stability: 2s"]
    DQ["Detection Queue<br/>(Redis)<br/>Max size: configurable"]
    DW["Detection Worker"]
    YOLO["YOLO26 API<br/>(Circuit Breaker)<br/>Timeout: 60s, Retries: 3"]
    BA["Batch Aggregator<br/>(90s window)<br/>Idle timeout: 30s"]
    AQ["Analysis Queue<br/>(Redis)"]
    AW["Analysis Worker"]
    EP["Enrichment Pipeline (opt.)<br/>Florence-2, CLIP, Depth, Pose"]
    NEM["Nemotron Analyzer<br/>(LLM API)<br/>Timeout: 120s, Retries: 3"]
    DB[("Event Creation<br/>(PostgreSQL)")]
    EB["Event Broadcaster<br/>(Redis Pub/Sub)<br/>Message buffer: 100"]
    WS["WebSocket Clients"]

    Camera --> FW
    FW --> DQ
    DQ --> DW
    DW --> YOLO
    YOLO --> BA
    BA --> AQ
    AQ --> AW
    AW --> EP
    EP --> NEM
    NEM --> DB
    DB --> EB
    EB --> WS

Quick Reference¶

Document	Description
image-to-event.md	Complete detection pipeline from camera image to security event
event-lifecycle.md	Event states from creation to archival
websocket-message-flow.md	Real-time WebSocket event broadcasting
api-request-flow.md	REST API request processing
batch-aggregation-flow.md	Detection batching with timing diagram
llm-analysis-flow.md	Nemotron LLM analysis request/response
enrichment-pipeline.md	Florence-2, CLIP, depth, pose enrichment
error-recovery-flow.md	Circuit breaker and retry sequences
startup-shutdown-flow.md	Application lifecycle sequences

Key Timing Parameters¶

Parameter	Default	Source
File debounce delay	0.5s	`backend/services/file_watcher.py:355`
File stability time	2.0s	`backend/services/file_watcher.py:362`
Batch window	90s	`backend/services/batch_aggregator.py:145`
Batch idle timeout	30s	`backend/services/batch_aggregator.py:146`
YOLO26 connect timeout	10s	`backend/services/detector_client.py:97`
YOLO26 read timeout	60s	`backend/services/detector_client.py:98`
Nemotron connect timeout	10s	`backend/services/nemotron_analyzer.py:130`
Nemotron read timeout	120s	`backend/services/nemotron_analyzer.py:131`
WebSocket idle timeout	300s	Configurable in settings
WebSocket heartbeat interval	30s	Configurable in settings

Key Circuit Breaker Parameters¶

Service	Failure Threshold	Recovery Timeout	Source
YOLO26	5	60s	`backend/services/detector_client.py:300-310`
Nemotron	5	30s	`backend/main.py:265-270`
PostgreSQL	10	60s	`backend/main.py:273-278`
Redis	10	60s	`backend/main.py:273-278`

Concurrency Control¶

The system uses a shared semaphore to prevent GPU/AI service overload:

# backend/services/nemotron_analyzer.py:19-22
# Uses a shared asyncio.Semaphore to limit concurrent AI inference operations.
# This prevents GPU/AI service overload under high traffic. The limit is
# configurable via AI_MAX_CONCURRENT_INFERENCES setting (default: 4).

Error Categories¶

The enrichment pipeline classifies errors for observability:

Category	Description	Retry?
`SERVICE_UNAVAILABLE`	Connection errors, service down	Yes
`TIMEOUT`	Request timed out	Yes
`RATE_LIMITED`	HTTP 429, back off	Yes
`SERVER_ERROR`	HTTP 5xx, transient issue	Yes
`CLIENT_ERROR`	HTTP 4xx, bad request	No
`PARSE_ERROR`	JSON/response parsing failed	No
`VALIDATION_ERROR`	Invalid input data	No
`UNEXPECTED`	Unknown error type	Yes

Source: backend/services/enrichment_pipeline.py:170-189

AI Pipeline Architecture - Detailed AI processing documentation
Real-time Architecture - WebSocket and event system details
Resilience Patterns - Circuit breakers and fault tolerance