Processes¶
Programs and scripts that move data through the pipeline. One page per process.
Status: Phase 2 placeholder
Detailed pages will be added in Phase 2.
Planned entries:
kvprocessor.cpp— Stage 1 ingestion (pulls raw data from remote CH hosts intokeywords.keywords_data_local).processing.py— Stage 2 derivation (computes allprocessed_*columns). This is the heart of the pipeline.pe_update.py— Stage 3 PE tracking selection.upload_backend.sh+ Slurm jobs — Stage 4 distribution tokeywords_metrics_localand Elasticsearch.extract_gt.py/extract_gkp.py— refresh extraction (independent cadence, signal-driven).keep_running.sh— top-level orchestrator chaining stages 1-4 hourly under screen.