Hello Astro Data Lab community,
I have been doing some data analysis on the NSC DR2 catalog via ADQL, specifically filtering for deep-space candidates and tracking pipeline calculation errors. My goal was to isolate cross-epoch tracklet linking failures by targeting extreme kinematic flags.
I used a query filtering for parameters like pmraerr > 90.0 (or extreme proper motion values) combined with an observation gap of deltamjd > 30.0 and ndet >= 3.
When I extracted this dataset and processed the coordinates through a 3D Python projection, I noticed a very unusual statistical distribution. Instead of the pipeline noise and artifacts being scattered randomly across the surveyed sky, the rejected candidates (over 270,000 objects) cluster very tightly—within a 3.0° window—along highly specific, symmetrical geometric axes (forming a hexahedral lattice pattern). A Monte Carlo test adjusting for ecliptic survey bias suggests this clustering is highly non-random.
I have uploaded the specific ADQL query, the Python processing scripts, and the resulting CSV dataset to Zenodo for reproducibility:
https://zenodo.org/records/20178610
My question is regarding the data processing pipeline: What algorithmic or instrumental mechanisms could cause pipeline noise and tracklet failures to align in such a highly structured, non-uniform spatial pattern? Could this geometric clustering be a known artifact of the astrometric reduction pipeline, a systemic bias in how the catalog handles extreme cross-epoch deviations, or an anomaly tied to the underlying survey footprint/tessellation?
Any insights into the pipeline architecture that could explain this specific distribution of errors would be greatly appreciated!
SELECT
id, ra, dec, pmra, pmdec, pmraerr, pmdecerr,
gmag, rmag, imag, ndet, mjd, deltamjd, class_star, flags
FROM nsc_dr2.object
WHERE (
(ra BETWEEN 42.0 AND 48.0 AND dec BETWEEN -3.0 AND 3.0) OR
(ra BETWEEN 132.0 AND 138.0 AND dec BETWEEN -3.0 AND 3.0) OR
(ra BETWEEN 222.0 AND 228.0 AND dec BETWEEN -3.0 AND 3.0) OR
(ra BETWEEN 312.0 AND 318.0 AND dec BETWEEN -3.0 AND 3.0) OR
(ra BETWEEN 357.0 AND 360.0 AND dec BETWEEN 42.0 AND 48.0) OR
(ra BETWEEN 0.0 AND 3.0 AND dec BETWEEN 42.0 AND 48.0) OR
(ra BETWEEN 87.0 AND 93.0 AND dec BETWEEN 42.0 AND 48.0) OR
(ra BETWEEN 177.0 AND 183.0 AND dec BETWEEN 42.0 AND 48.0) OR
(ra BETWEEN 267.0 AND 273.0 AND dec BETWEEN 42.0 AND 48.0) OR
(ra BETWEEN 357.0 AND 360.0 AND dec BETWEEN -48.0 AND -42.0) OR
(ra BETWEEN 0.0 AND 3.0 AND dec BETWEEN -48.0 AND -42.0) OR
(ra BETWEEN 87.0 AND 93.0 AND dec BETWEEN -48.0 AND -42.0) OR
(ra BETWEEN 177.0 AND 183.0 AND dec BETWEEN -48.0 AND -42.0) OR
(ra BETWEEN 267.0 AND 273.0 AND dec BETWEEN -48.0 AND -42.0)
)
AND (pmraerr > 90.0 OR pmdecerr > 90.0 OR gmag > 90.0 OR (pmra*pmra + pmdec*pmdec) > 2500.0)
AND deltamjd > 30.0
AND ndet >= 3
Thanks, Victor