MIMIC IV/hosp/labevents, entry error or duplicated entries for same subject and same charttime? #1960

ahipSQL · 2025-12-26T09:43:12Z

ahipSQL
Dec 26, 2025

I use labevents data for my study and found something wiered. When I ran the code in python/pandas as below,

df.groupby(['subject_id', 'charttime', 'label']).filter(lambda x: len(x) > 1)

I found same subject_id, at same charttime, same itemid/label but with different data entries

For example, as index 16407 and 16416 in the figure, same subject_id, at same charttime, same itemid/label, but different storetime, different speciment_id, and different value/valuenum. Since these are CSF data, it's not common to perform lumbar puncture twice in a short time period. What do these data stand for and how should I handle them?

Thanks. Merry X'mas & Happy Holiday!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MIMIC IV/hosp/labevents, entry error or duplicated entries for same subject and same charttime? #1960

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

MIMIC IV/hosp/labevents, entry error or duplicated entries for same subject and same charttime? #1960

Uh oh!

Uh oh!

ahipSQL Dec 26, 2025

Replies: 0 comments

ahipSQL
Dec 26, 2025