Database Operations

Relevant Source Files

compute/utils/db.py neurons/Validator/database/allocate.py neurons/Validator/database/challenge.py neurons/validator.py

This document covers the database operations used by the NI Compute Subnet validator system to persist miner statistics, challenge results, allocation records, and proof-of-GPU data. The system uses SQLite for local data persistence with structured schemas for tracking network state and performance metrics.

For information about the scoring algorithms that use this data, see Scoring System. For details about proof-of-GPU validation that writes to these tables, see Proof of GPU.

Database Architecture

The compute subnet uses a centralized SQLite database managed by the ComputeDb class to store all validator-related data. The database serves as the primary persistence layer for validator operations, storing everything from miner registration details to performance benchmarks.

graph TB
    subgraph "Database Layer"
        ComputeDb["ComputeDb<br/>SQLite Connection Manager"]
        SQLiteDB[("SQLite Database<br/>database.db")]
    end
    
    subgraph "Core Tables"
        MinerTable["miner<br/>(uid, ss58_address)"]
        MinerDetailsTable["miner_details<br/>(hotkey, details, no_specs_count)"]
        ChallengeTable["challenge_details<br/>(uid, success, elapsed_time, difficulty)"]
        AllocationTable["allocation<br/>(hotkey, details)"]
        PogStatsTable["pog_stats<br/>(hotkey, gpu_name, num_gpus)"]
        StatsTable["stats<br/>(uid, hotkey, gpu_specs, score)"]
        BlacklistTable["blacklist<br/>(hotkey, details)"]
        WandbTable["wandb_runs<br/>(hotkey, run_id)"]
    end
    
    subgraph "Accessing Components"
        ValidatorProcess["Validator Process<br/>neurons/validator.py"]
        ChallengeOps["Challenge Operations<br/>database/challenge.py"]
        AllocateOps["Allocation Operations<br/>database/allocate.py"]
        PogOps["PoG Operations<br/>database/pog.py"]
        MinerOps["Miner Operations<br/>database/miner.py"]
    end
    
    ComputeDb --> SQLiteDB
    
    SQLiteDB --> MinerTable
    SQLiteDB --> MinerDetailsTable
    SQLiteDB --> ChallengeTable
    SQLiteDB --> AllocationTable
    SQLiteDB --> PogStatsTable
    SQLiteDB --> StatsTable
    SQLiteDB --> BlacklistTable
    SQLiteDB --> WandbTable
    
    ValidatorProcess --> ComputeDb
    ChallengeOps --> ComputeDb
    AllocateOps --> ComputeDb
    PogOps --> ComputeDb
    MinerOps --> ComputeDb
    
    MinerTable -.->|"Foreign Key"| ChallengeTable
    MinerDetailsTable -.->|"Foreign Key"| PogStatsTable
    MinerDetailsTable -.->|"Foreign Key"| StatsTable

Sources: compute/utils/db.py:9-84 , neurons/validator.py:170-172

Core Database Tables

The database schema consists of eight primary tables, each serving specific validator functions:

Table Name	Primary Key	Purpose	Key Relationships
`miner`	`uid`	Basic miner registration	Referenced by `challenge_details`
`miner_details`	`hotkey`	Hardware specifications and Docker status	Referenced by `pog_stats`, `stats`
`challenge_details`	Auto-increment	Proof-of-Work challenge results	Foreign keys to `miner` table
`allocation`	`hotkey`	Active resource allocations	Unique hotkey constraint
`pog_stats`	Auto-increment	Proof-of-GPU benchmark results	Foreign key to `miner_details`
`stats`	`uid`	Comprehensive miner scoring data	Foreign key to `miner_details`
`blacklist`	Auto-increment	Penalized miner hotkeys	Unique hotkey constraint
`wandb_runs`	`hotkey`	WandB run tracking	Links to external monitoring

Miner Registration Tables

The miner table stores basic network registration data, while miner_details contains comprehensive hardware specifications:

erDiagram
    miner {
        INTEGER uid PK
        TEXT ss58_address UK
    }
    
    miner_details {
        INTEGER id PK
        TEXT hotkey UK
        TEXT details
        INTEGER no_specs_count
    }
    
    challenge_details {
        INTEGER uid FK
        TEXT ss58_address FK
        BOOLEAN success
        REAL elapsed_time
        INTEGER difficulty
        TIMESTAMP created_at
    }
    
    miner ||--o{ challenge_details : "has challenges"
    miner_details ||--o{ pog_stats : "has PoG results"
    miner_details ||--o{ stats : "has statistics"

Sources: compute/utils/db.py:29-30 , compute/utils/db.py:30-31 , compute/utils/db.py:33-45

Performance and Allocation Tracking

The system maintains detailed performance metrics and resource allocation state through specialized tables:

graph LR
    subgraph "Performance Tracking"
        PogStats["pog_stats<br/>GPU benchmarking results"]
        ChallengeDetails["challenge_details<br/>PoW challenge outcomes"]
        Stats["stats<br/>Aggregated scoring data"]
    end
    
    subgraph "Resource Management"
        Allocation["allocation<br/>Active resource assignments"]
        Blacklist["blacklist<br/>Penalized miners"]
        WandbRuns["wandb_runs<br/>External monitoring links"]
    end
    
    subgraph "Data Sources"
        ValidatorProcess["Validator Process"]
        PogValidation["PoG Validation"]
        AllocationAPI["Allocation API"]
    end
    
    ValidatorProcess --> Stats
    PogValidation --> PogStats
    ValidatorProcess --> ChallengeDetails
    AllocationAPI --> Allocation
    ValidatorProcess --> Blacklist
    ValidatorProcess --> WandbRuns

Sources: compute/utils/db.py:53-62 , compute/utils/db.py:64-77 , compute/utils/db.py:46-47

Database Operations by Component

Challenge Management Operations

The challenge system tracks proof-of-work validation results with comprehensive statistical analysis:

sequenceDiagram
    participant V as "Validator Process"
    participant CD as "ChallengeOps<br/>challenge.py"
    participant DB as "ComputeDb"
    participant CT as "challenge_details table"
    
    V->>CD: update_challenge_details(pow_benchmarks)
    CD->>DB: get_cursor()
    CD->>CT: INSERT challenge results
    Note over CD,CT: Bulk insert with executemany
    CD->>DB: commit()
    
    V->>CD: select_challenge_stats()
    CD->>CT: Complex query with CTEs
    Note over CD,CT: Analyzes last 60 attempts<br/>calculates success rates
    CD->>CD: Process statistics
    CD-->>V: Return aggregated stats dict

The select_challenge_stats function uses Common Table Expressions (CTEs) to analyze challenge performance over rolling windows, calculating success rates and average difficulties for the most recent 20 and 60 attempts.

Sources: neurons/Validator/database/challenge.py:24-125 , neurons/Validator/database/challenge.py:128-176

Miner Details and Allocation Management

Allocation operations manage hardware specifications and resource assignment state:

graph TD
    subgraph "Miner Specification Flow"
        GetSpecs["get_miner_details()<br/>Retrieve all miner specs"]
        UpdateSpecs["update_miner_details()<br/>Batch update from WandB"]
        CheckDocker["select_has_docker_miners_hotkey()<br/>Filter Docker-capable miners"]
    end
    
    subgraph "Allocation Management"
        AllocateCheck["select_allocate_miners_hotkey()<br/>Find miners meeting requirements"]
        UpdateAllocation["update_allocation_db()<br/>Track active allocations"]
        UpdateBlacklist["update_blacklist_db()<br/>Manage penalized miners"]
    end
    
    subgraph "Database Tables"
        MinerDetailsTable[("miner_details")]
        AllocationTable[("allocation")]
        BlacklistTable[("blacklist")]
    end
    
    GetSpecs --> MinerDetailsTable
    UpdateSpecs --> MinerDetailsTable
    CheckDocker --> MinerDetailsTable
    
    AllocateCheck --> MinerDetailsTable
    UpdateAllocation --> AllocationTable
    UpdateBlacklist --> BlacklistTable
    
    MinerDetailsTable -.->|"JSON details parsing"| AllocateCheck

The update_miner_details function includes automatic schema migration logic to handle database structure changes while preserving existing data.

Sources: neurons/Validator/database/allocate.py:26-45 , neurons/Validator/database/allocate.py:93-176 , neurons/Validator/database/allocate.py:178-206

Proof-of-GPU Statistics Management

PoG operations maintain GPU benchmarking results and performance metrics:

flowchart LR
    subgraph "PoG Database Operations"
        UpdatePogStats["update_pog_stats()<br/>Store GPU benchmark results"]
        GetPogSpecs["get_pog_specs()<br/>Retrieve GPU specifications"]
        RetrieveStats["retrieve_stats()<br/>Load scoring statistics"]
        WriteStats["write_stats()<br/>Update comprehensive stats"]
    end
    
    subgraph "Data Tables"
        PogStatsTable[("pog_stats<br/>hotkey, gpu_name, num_gpus")]
        StatsTable[("stats<br/>uid, score, gpu_specs")]
    end
    
    subgraph "Processing Flow"
        PogValidation["PoG Validation Process"]
        ScoreCalculation["Score Calculation"]
        NetworkWeights["Network Weight Setting"]
    end
    
    PogValidation --> UpdatePogStats
    UpdatePogStats --> PogStatsTable
    GetPogSpecs --> PogStatsTable
    
    RetrieveStats --> StatsTable
    WriteStats --> StatsTable
    
    GetPogSpecs --> ScoreCalculation
    RetrieveStats --> ScoreCalculation
    ScoreCalculation --> WriteStats
    WriteStats --> NetworkWeights

Sources: neurons/Validator/database/pog.py (referenced), neurons/validator.py:361-362 , neurons/validator.py:402-403

Data Persistence Workflow

Validator Database Integration

The validator process integrates with the database through multiple synchronized operations:

sequenceDiagram
    participant VP as "Validator Process<br/>validator.py"
    participant DB as "ComputeDb"
    participant WB as "WandB Integration"
    participant NS as "Network State"
    
    Note over VP: Initialization Phase
    VP->>DB: ComputeDb()
    VP->>DB: select_miners()
    DB-->>VP: miners dict
    
    Note over VP: Scoring Synchronization
    VP->>DB: retrieve_stats()
    VP->>WB: get_allocated_hotkeys()
    VP->>WB: get_stats_allocated()
    VP->>VP: sync_scores()
    VP->>DB: write_stats()
    
    Note over VP: Allocation Monitoring
    VP->>DB: SELECT FROM allocation
    VP->>WB: update_allocated_hotkeys()
    
    Note over VP: PoG Results Processing
    VP->>VP: proof_of_gpu()
    VP->>DB: update_pog_stats()
    VP->>VP: sync_scores()
    VP->>NS: Set network weights

The validator maintains a continuous cycle of data synchronization between local database state, distributed WandB state, and blockchain network state.

Sources: neurons/validator.py:170-172 , neurons/validator.py:312-404 , neurons/validator.py:663-787

Database Transaction Management

All database operations use transaction-safe patterns with proper error handling:

Operation Type	Transaction Pattern	Error Handling
Single Inserts	`cursor.execute()` + `commit()`	`rollback()` on exception
Bulk Operations	`cursor.executemany()` + `commit()`	`rollback()` + logging
Complex Queries	Read-only, no transaction	Exception logging only
Schema Changes	DDL statements + `commit()`	`rollback()` + preservation

The database connection uses check_same_thread=False to support multi-threaded validator operations while maintaining thread safety through proper cursor management.

Sources: compute/utils/db.py:13-17 , neurons/Validator/database/challenge.py:140-176 , neurons/Validator/database/allocate.py:211-229