RecordingExtractors #171

pauladkisson · 2025-11-20T01:09:47Z

Recording Extractors Architecture

Fixes #170

Overview

This refactor replaces monolithic format detection and reading logic with a modular extractor architecture. Each data format (TDT, Doric, CSV, NPM) now has its own dedicated class implementing a common interface.

Benefits: Modularity, extensibility for new formats, consistent API, and isolated testability.

Architecture

classDiagram
    class BaseRecordingExtractor {
        <>
        +discover_events_and_flags()* tuple~list, list~
        +read(events, outputPath)* list~dict~
        +save(output_dicts, outputPath)* None
        #_write_hdf5(data, storename, output_path, key) None
    }
    
    class TdtRecordingExtractor
    class DoricRecordingExtractor
    class CsvRecordingExtractor
    class NpmRecordingExtractor
    
    BaseRecordingExtractor <|-- TdtRecordingExtractor
    BaseRecordingExtractor <|-- DoricRecordingExtractor
    BaseRecordingExtractor <|-- CsvRecordingExtractor
    CsvRecordingExtractor <|-- NpmRecordingExtractor

API Contract

All extractors implement three methods:

Method	Purpose
`discover_events_and_flags()`	Class method to find available events in data files
`read(*, events, outputPath)`	Extract data for specified events → returns list of dicts
`save(*, output_dicts, outputPath)`	Write extracted data to HDF5

Note: discover_events_and_flags() has a flexible signature—NPM requires additional num_ch and inputParameters arguments for channel configuration.

NPM Configuration Pattern: Tkinter GUI code has been moved out of the extractor and into saveStoresList.py. The extractor provides helper methods (has_multiple_event_ttls(), needs_ts_unit()) to determine what configuration is needed, while the GUI layer collects user input and passes it to discover_events_and_flags() via inputParameters. This keeps the extractor free of GUI dependencies.

Pipeline Integration

Step 2 (saveStoresList.py): Calls discover_events_and_flags() to find events, presents GUI for user to create friendly name mappings → outputs storesList.csv
Step 3 (readTevTsq.py): Creates appropriate extractor, reads storesList.csv for event list, processes all events in parallel via read_and_save_all_events() → outputs HDF5 files

Doric note: Uses storesList.csv to build the required event_name_to_event_type mapping.

Data Flow

flowchart TB
    A[Raw Data Files] --> B[Step 2: saveStoresList.py]
    B --> C[discover_events_and_flags]
    C --> D[GUI: User Maps Events]
    D --> E[storesList.csv]
    E --> F[Step 3: readTevTsq.py]
    A --> F
    F --> G[Create Extractor]
    G --> H[read_and_save_all_events]
    H --> I[HDF5 Files]

Co-authored-by: Copilot <[email protected]>

…ractor.

…xtractor.

…nd flags.

…nstead of properties.

…s into the base_recording_extractor and removed all duplicates.

pauladkisson added 30 commits November 17, 2025 10:59

Moved readtsq to tdt_step2.py.

aa4340d

Moved import_np_doric_csv to np_doric_csv_step2.py.

c868823

Split import_csv out from import_np_doric_csv

a06cae4

Fixed TDT

66d60e2

Split import_doric out from import_np_doric_csv

4f4e1c9

Removed unnecessary imports

341d77d

Split import_npm out from import_np_doric_csv

0bcd4fe

Added modality selector to the GUI.

7b36f64

Added modality selector to the GUI.

100ad14

Added modality option to the api and tests

ef978ec

Removed intermediate np_doric_csv_step2 module.

6589139

Split tdt_step3.py off from read_raw_data.py.

e7ac4d8

Hard-coded modality to simplify read.

2f57867

Split doric_step3.py off from read_raw_data.py.

092e1b7

Added check_doric to doric_step3.py.

7abb8e0

Split csv_step3.py off from read_raw_data.py.

b653538

Added modality to Step 3.

6d661c2

Added tdtRecordingExtractor

a4f6583

Adapted parallel execute function to use new extractor.

882556e

Added CsvRecordingExtractor for step 2

df7b9e1

Installed pre-commit.

bcb78a5

Added CsvRecordingExtractor for step 3

1c8ee07

Added DoricRecordingExtractor for step 2

9262a5a

Added DoricRecordingExtractor for step 2

9c5afce

Added DoricRecordingExtractor for step 3

914f23f

streamlined inputs

cd966ae

Added NpmRecordingExtractor for step 2

ac158de

Added NpmRecordingExtractor for step 3

6a470a1

Merge branch 'modularization' into extractor

4903817

Add a tdt_check_data example session to the tests.

9b88cad

pauladkisson and others added 23 commits November 21, 2025 12:37

Added high-level save

ddf6ae5

Added TODO

212c7c5

Added multi-processing back in.

33682d2

Fixed test_step5.py for tdt_check_data

f84c550

Fixed test_step4.py for tdt_check_data

c55a230

Renamed test_case from tdt_check_data to tdt_split_event.

03ffd54

Standardize read and save (#188)

27acc6c

Remove tkinter from NPM (#189)

a633550

Co-authored-by: Copilot <[email protected]>

Defined BaseRecordingExtractor.

d55bba7

Removed obsolete intermediates extractor steps

1689b7e

Refactored csv_recording_extractor to inherit from base_recording_ext…

b35e04b

…ractor.

Refactored tdt_recording_extractor to inherit from base_recording_ext…

b330a64

…ractor.

Updated parameter names for saveStoresList.

8af3b2b

Refactored npm_recording_extractor to inherit from base_recording_ext…

5dc6d78

…ractor.

Refactored doric_recording_extractor to inherit from base_recording_e…

861e991

…xtractor.

Refactored doric_recording_extractor to use class method for events a…

dd40cb4

…nd flags.

Refactored Extractors to use class method discover_events and flags i…

4619964

…nstead of properties.

Refactored Extractors to use class method discover_events and flags i…

beb585f

…nstead of properties.

Added comment about discover_events_and_flags signature

1b5e8ca

Removed unused quarks.

2e38ee8

Refactored NpmRecordingExtractor to inherit from CsvRecordingExtractor.

cdecf42

Updated TODO

d43670f

Centralized read_and_save_all_events and read_and_save_event function…

cd245a1

…s into the base_recording_extractor and removed all duplicates.

pauladkisson changed the base branch from modularization to dev December 4, 2025 02:04

Removed redundant intermediate common_step3.py.

7e69cc7

pauladkisson marked this pull request as ready for review December 4, 2025 02:33

pauladkisson requested a review from venus-sherathiya December 4, 2025 02:34

pauladkisson mentioned this pull request Dec 15, 2025

Modularize Analysis #190

Draft

6 tasks

pauladkisson merged commit b58994d into dev Dec 17, 2025
17 checks passed

pauladkisson deleted the extractor branch December 17, 2025 00:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RecordingExtractors #171

RecordingExtractors #171

Uh oh!

pauladkisson commented Nov 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RecordingExtractors #171

RecordingExtractors #171

Uh oh!

Conversation

pauladkisson commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Recording Extractors Architecture

Overview

Architecture

API Contract

Pipeline Integration

Data Flow

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pauladkisson commented Nov 20, 2025 •

edited

Loading