name

coding-with-query-engine

description

Build and troubleshoot Mu2e Query Engine (QE) URL reads for PostgreSQL-backed conditions and operations data. Use when composing QE URLs, doing Python HTTP reads, or implementing C++ database fetches with Offline DbService DbReader/DbId helpers.

compatibility

Requires Mu2e Offline environment, network access to FNAL QE endpoints, and familiarity with Offline/DbService and conditions table naming

metadata

version	last-updated
1.0.1	2026-02-20

Coding with Query Engine

Overview

Use this skill for Mu2e code that reads PostgreSQL table data through Fermilab Query Engine (QE) over HTTP.

Primary usage modes:

Build QE query URLs and validate query semantics
Read data in Python via HTTP GET
Read data in C++ using Mu2e DbService wrappers

For Mu2e C++ applications, do not use the official QE C API directly. Use Mu2e wrappers centered on:

Offline/DbService/inc/DbReader.hh
Offline/DbService/inc/DbIdList.hh
Offline/DbTables/inc/DbId.hh
Offline/DbService/inc/DbCurl.hh

Reference docs:

QE URL Model (REST)

Base endpoint pattern:

<prefix>/query?dbname=<db>&t=<table>[&c=<cols>][&w=<cond>...][&o=<sort>][&l=<limit>][&f=<fmt>][&x=<cacheCtl>]

Core parameters:

dbname (required): logical database name
t (required): table name (schema.table when needed)
c (optional): comma-separated column list, default all columns
w (optional, repeatable): where terms, typically column:op:value
o (optional): sort columns, prefix - for descending
l (optional): row limit
f (optional): output format (csv default, text supported)
x (optional): cache control (yes, no, clear)

Comparison operators for w: lt, le, eq, ne, ge, gt.

Example semantics:

w=cid:eq:2&w=channel:le:1

means SQL-like AND conditions.

Cache vs No-Cache Behavior

Mu2e relies on two QE URL heads per database connection:

cache URL (...:8444/.../query?): uses web cache for scale
nocache URL (...:9443/.../query?): bypasses cache and reads DB directly

In Mu2e connection config (Offline/DbService/data/connections.txt), each database has both endpoints.

Important behavior:

Use cache for high-rate stable table reads
Use no-cache when latest DB entries must be discovered immediately
Non-cache reads bypass cache and do not refresh cached entries

C++ DbReader behavior details:

Non-val* tables use cache by default (setUseCache(true))
val* IoV metadata tables default to no-cache for freshness
setCacheLifetime(n) can provide bounded caching for val* queries via a rolling URL token

Mu2e Database Names (from connections)

Known logical database names in current config include:

mu2e_conditions_prd
mu2e_conditions_dev
mu2e_dqm_prd
dcs_archive
mu2e_tracker_prd
run_info

Always pass the logical name via dbname=<name>.

Standard Workflow

1) Start from a concrete query goal

Define precisely:

target logical database (dbname)
target table (t)
required columns (c)
filter predicates (w)
sort/limit (o, l)
freshness requirement (cache vs no-cache)

2) Validate URL shape with curl

curl -s "https://dbdata0vm.fnal.gov:8444/QE/mu2e/prod/app/SQ/query?dbname=mu2e_conditions_prd&t=val.tables&c=tid,name&o=tid" | head

Force direct DB read:

curl -s "https://dbdata0vm.fnal.gov:9443/QE/mu2e/prod/app/SQ/query?dbname=mu2e_conditions_prd&t=val.tables&c=tid,name&o=tid" | head

3) Implement in Python or C++

Python: compose URL with proper query encoding and repeat w
C++: use DbIdList + DbReader and pass select/table/where/order pieces

4) Confirm result shape and freshness

Verify header/content format (csv default)
Check row count and ordering
Re-run against cache and no-cache endpoints when freshness matters

Example: cached read for `run_info` / `test_sc.run`

Use this when you need enough detail to identify the run table definition.

Confirm tables in schema test_sc:

curl -s "https://dbdata0vm.fnal.gov:8444/QE/mu2e/prod/app/SQ/I/tables?dbname=run_info&ns=test_sc&f=json"

Get run table column definition (names/types):

curl -s "https://dbdata0vm.fnal.gov:8444/QE/mu2e/prod/app/SQ/I/columns?dbname=run_info&t=test_sc.run&f=json"

Optional: probe a few rows from the table using cache endpoint:

curl -s "https://dbdata0vm.fnal.gov:8444/QE/mu2e/prod/app/SQ/query?dbname=run_info&t=test_sc.run&c=*&l=5"

If the definition appears stale, compare against no-cache endpoint:

curl -s "https://dbdata0vm.fnal.gov:9443/QE/mu2e/prod/app/SQ/I/columns?dbname=run_info&t=test_sc.run&f=json"

Notes:

For table definition, prefer /I/columns over /query.
Use fully qualified table name test_sc.run.
f=json is easiest for parsing column metadata programmatically.
In current QE deployment, /I/tables?...&f=csv may return a server error; use f=json for schema introspection.

Python Pattern

Use URL parameter encoding rather than manual string concatenation.

from urllib.parse import urlencode
from urllib.request import urlopen

base = "https://dbdata0vm.fnal.gov:9443/QE/mu2e/prod/app/SQ/query?"
params = [
    ("dbname", "mu2e_conditions_prd"),
    ("t", "tst.calib1"),
    ("c", "flag,dtoe"),
    ("w", "cid:eq:2"),
    ("w", "channel:le:1"),
    ("o", "-flag"),
    ("f", "csv"),
]
url = base + urlencode(params)

with urlopen(url, timeout=30) as response:
    payload = response.read().decode("utf-8")

print(payload)

Guidelines:

Keep params as ordered tuples so repeated w keys are preserved
Use timeout and explicit error handling for production code
Prefer no-cache endpoint when checking for just-inserted rows

C++ Pattern (Mu2e wrappers)

Use DbIdList to resolve logical DB name to endpoint URLs, then query through DbReader.

#include "Offline/DbService/inc/DbIdList.hh"
#include "Offline/DbService/inc/DbReader.hh"

mu2e::DbIdList idList;
mu2e::DbId id = idList.getDbId("mu2e_conditions_prd");

mu2e::DbReader reader;
reader.setDbId(id);
reader.setUseCache(true);
reader.setAbortOnFail(false);

std::string csv;
mu2e::DbReader::StringVec where;
where.emplace_back("cid:eq:2");
where.emplace_back("channel:le:1");

int rc = reader.query(csv, "flag,dtoe", "tst.calib1", where, "-flag");
if (rc != 0) {
  auto err = reader.lastError();
}

Key API notes:

query(csv, select, table, where, order) maps directly to URL pieces
multiQuery(...) reuses socket/handle for multiple related reads
setAbortOnFail(false) lets caller handle failures instead of throwing
Returned CSV from DbReader has header line removed by default

Do not switch to QE official C API for Mu2e Offline code paths unless specifically required by external integration.

Query Composition Rules

Keep table and column names exact and lower-case where required by schema.
Build each w clause as field:op:value unless using implicit eq form.
Repeat w for AND logic; avoid embedding SQL text.
Use deterministic ordering for reproducible comparisons.
Prefer explicit columns over * in production code.
Separate freshness checks (no-cache) from steady-state reads (cache).

Troubleshooting

Empty or unexpected results

Verify dbname, schema/table name, and column names
Test same URL with no-cache endpoint
Remove filters incrementally, then re-add one by one

Intermittent failures or gateway errors

Retry with bounded backoff
For C++, inspect reader.lastError() and return code
Increase DbReader timeout only when justified

Fresh data not visible

Confirm you are using no-cache endpoint (or x=no if supported)
For Mu2e C++, disable cache with setUseCache(false) where appropriate
Re-check val* table behavior and any cache lifetime settings

Safety and Performance Guardrails

Keep queries narrow (c, w, l) to reduce payload and latency
Avoid repeated broad scans from grid jobs
Default to cached reads for stable data at scale
Use direct DB reads sparingly and intentionally
Log canonical URL components (without secrets) when debugging

Output Template

When asked to produce QE access code, return:

Goal:
- <what data is needed and why>

Chosen endpoint mode:
- cache|nocache
- reason: <freshness vs scale>

Query components:
- dbname: <...>
- table: <...>
- columns: <...>
- where:
  - <w1>
  - <w2>
- order: <...>
- limit: <...>

Implementation:
- Language: Python|C++
- Code: <minimal runnable snippet>

Validation:
- curl URL used:
- row-count or sample rows:
- cache-vs-nocache comparison result:

References

Query Engine
Mu2e ConditionsDbCode URL format
Offline/DbService/inc/DbReader.hh
Offline/DbService/src/DbReader.cc
Offline/DbService/inc/DbIdList.hh
Offline/DbService/src/DbIdList.cc
Offline/DbService/data/connections.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Coding with Query Engine

Overview

QE URL Model (REST)

Cache vs No-Cache Behavior

Mu2e Database Names (from connections)

Standard Workflow

1) Start from a concrete query goal

2) Validate URL shape with curl

3) Implement in Python or C++

4) Confirm result shape and freshness

Example: cached read for `run_info` / `test_sc.run`

Python Pattern

C++ Pattern (Mu2e wrappers)

Query Composition Rules

Troubleshooting

Empty or unexpected results

Intermittent failures or gateway errors

Fresh data not visible

Safety and Performance Guardrails

Output Template

References

FilesExpand file tree

SKILL.md

Latest commit

History

SKILL.md

File metadata and controls

Coding with Query Engine

Overview

QE URL Model (REST)

Cache vs No-Cache Behavior

Mu2e Database Names (from connections)

Standard Workflow

1) Start from a concrete query goal

2) Validate URL shape with curl

3) Implement in Python or C++

4) Confirm result shape and freshness

Example: cached read for run_info / test_sc.run

Python Pattern

C++ Pattern (Mu2e wrappers)

Query Composition Rules

Troubleshooting

Empty or unexpected results

Intermittent failures or gateway errors

Fresh data not visible

Safety and Performance Guardrails

Output Template

References

Example: cached read for `run_info` / `test_sc.run`