Fix inference bug when there are NULL in columns by hjk1030 · Pull Request #175 · ddkang/aidb

hjk1030 · 2024-04-15T14:28:38Z

The problem in issue #104 seems to be caused by having NaN in the dataframe, which is not possible to transformed into json. By replacing the NaN in the dataframe with empty string should fix the issue.

P.S. I cannot really reproduce the problem since the blank place in the given table are not really null, instead are wrongly recognized strings starts with character '='. However, after searching on stackoverflow I believe the fix should resolve the issue.

ddkang · 2024-04-15T14:36:01Z

@ttt-77 how come the test case doesn't reflect the issue?

ttt-77 · 2024-04-16T03:32:04Z

I removed the rows containing NULL values for previous tests. Can you use the data file from the provided link to see if you can reproduce the error? To expedite the process, you can retain only a few normal rows and all abnormal rows initially. @hjk1030

https://drive.google.com/file/d/19lbMHnAPVs41iHlZXukRT6j2jUvJ7se8/view?usp=sharing

hjk1030 · 2024-04-16T16:57:39Z

I still can't reproduce the same error. It seems that the program would abort at a type check before the request is sent(though the fix works for that). Could you provide the test script that the error happened?

ttt-77 · 2024-04-16T21:36:33Z

It appears that JSON now allows 'None' values, so this is no longer an issue. However, rows containing 'None' values will be dropped. Could you check if we can remove '.dropna()' from the following code? @continue-revolution

aidb/tests/inference_service_utils/http_inference_service_setup.py

Line 66 in 5bb4477

group = group.drop(columns=name_to_input_cols[service_name]).dropna()

continue-revolution · 2024-04-24T19:51:25Z

I think it's fine to remove "dropna()"

hjk1030 · 2024-04-25T15:41:51Z

I believe dropna is still needed as there are lines containing only null values corresponding to no outputs. I changed the parameters to dropping only these lines and it seems to pass the test for now.

Fix: nan is not json compliant

fe97b32

ddkang requested review from continue-revolution and ttt-77 April 15, 2024 14:35

hjk1030 and others added 4 commits April 25, 2024 09:18

Removing dropna and previous changes

28f5e67

Merge branch 'ddkang:main' into null_in_csv

fdaabff

Adding back the errorly removed drop function

df86090

Drop the all null lines corresponds to no output

bce76bf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix inference bug when there are NULL in columns#175

Fix inference bug when there are NULL in columns#175
hjk1030 wants to merge 5 commits intoddkang:mainfrom
hjk1030:null_in_csv

hjk1030 commented Apr 15, 2024

Uh oh!

ddkang commented Apr 15, 2024

Uh oh!

ttt-77 commented Apr 16, 2024

Uh oh!

hjk1030 commented Apr 16, 2024

Uh oh!

ttt-77 commented Apr 16, 2024

Uh oh!

continue-revolution commented Apr 24, 2024

Uh oh!

hjk1030 commented Apr 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

hjk1030 commented Apr 15, 2024

Uh oh!

ddkang commented Apr 15, 2024

Uh oh!

ttt-77 commented Apr 16, 2024

Uh oh!

hjk1030 commented Apr 16, 2024

Uh oh!

ttt-77 commented Apr 16, 2024

Uh oh!

continue-revolution commented Apr 24, 2024

Uh oh!

hjk1030 commented Apr 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants