Notify user if two datasets with different hashes are compared by RaghavaAlajangi · Pull Request #219 · DC-analysis/DCscope

RaghavaAlajangi · 2025-07-18T12:45:16Z

This PR aims to implement the feature mentioned in issue #217

implement logic
run tests locally
CICD passed
update CHANGELOG after review

RaghavaAlajangi · 2025-07-18T12:50:34Z

Case-1:

If more than one different dataset with different pipeline hashes are compared, you can see labels like Pipeline type A, Pipeline type B, so on.

Case-2:

If the datasets with the same pipeline hashes are compared, you don't see any labels.

codecov · 2025-07-18T12:51:38Z

Codecov Report

❌ Patch coverage is 89.65517% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 78.48%. Comparing base (22ae7ff) to head (7c0fbf3).
⚠️ Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
shapeout2/gui/pipeline_plot.py	89.65%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #219      +/-   ##
==========================================
+ Coverage   78.45%   78.48%   +0.03%     
==========================================
  Files          67       67              
  Lines        7629     7654      +25     
==========================================
+ Hits         5985     6007      +22     
- Misses       1644     1647       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

paulmueller · 2025-07-21T15:06:04Z

Thanks! It might not be clear to the user what the "Type" label means. How about "Pipeline HASH" where HASH are the first four characters of the analysis pipeline? The label should be displayed in each plot when the hash differs in at least one of them.

RaghavaAlajangi · 2025-07-21T15:15:38Z

As you can see from the above plots, I changed this to Pipeline type A, Pipeline type B, and so on. Does it work, or should I replace letters with a hash?

…nt case

paulmueller · 2025-07-21T22:21:16Z

I would say the word "type" is incorrect (there is no pipeline type, only different pipeline parameters) and "A" and "B" is too generic. Think about the case where you have two plots, each of them containing data from unique pipelines, but Shape-Out displays them as "A" and "B". Then a user might assume that "A" is always "A" and "B" is always "B", and we are again at the apples-vs-oranges comparison. Use "Pipeline HASH". In cases where the first four characters of two different pipelines match, more characters should be appended to the displayed hash.

… logic

RaghavaAlajangi · 2025-07-22T14:48:01Z

Hi @paulmueller
Can you review these changes?

paulmueller · 2025-07-24T20:22:03Z

shapeout2/gui/pipeline_plot.py

+def get_hash_flag(hash_set, rtdc_ds):
+    """Helper function to determine the hash flag based on the dataset and
+    hash set."""
+    short_hash_set = set(h[:4] if h is not None else None for h in hash_set)


The hash length should be dynamic in all cases. I.e. if the first 4 characters of two hashes are identical, then the hash length should be 5, but if the first 5 characters are identical, then the hash length should be 6 etc. It is very unlikely to happen, but it can happen at some point.

There might be a smart way of achieving this with list comprehensions, but a simple for-loop over the length of the longest hash (incrementing req_hash_len and generating short_hash_set) with the list/set comprehension you proposed would be good enough.

BTW this is a good design, putting the logic of whether to show the text and what text to show in one single method 👍

RaghavaAlajangi · 2025-07-25T09:45:31Z

Hi Paul,
please have a look at these changes.

paulmueller · 2025-07-26T20:53:27Z

tests/test_gui_pipeline_plot.py

+
+
+def test_get_hash_flag():
+    rtdc_paths = datapath.glob("*.rtdc")


Please add an assert rtdc_paths to make sure this test does not get skipped in case the data directory changes.

paulmueller

This looks good 👍 . To make things air tight, please add this to the testing code:

The test for get_hash_flag is very generic and does not explicitly check some of the cases.

Please add two more tests with the corresponding .rtdc files:

hash_set only contains None -> get_hash_flag returns None
hash_flag contains at least on hash -> get_hash_flag returns "Pipeline HASH".

These explicit tests will help avoid regressions in the code.

RaghavaAlajangi · 2025-07-28T12:51:36Z

Hi @paulmueller
Please look at these changes now.

paulmueller

❤️

DC-analysis#219)" This reverts commit ab8887d.

feat: add dcnum hash mapping and integrate into PipelinePlot redraw

aaa996d

fix: improve dcnum hash mapping transformation logic for single eleme…

47ba05f

…nt case

RaghavaAlajangi added 3 commits July 22, 2025 15:46

ref: optimize dcnum hash handling by using a hash set and simplifying…

1a59d19

… logic

fix: improve hash flag determination logic in get_hash_flag function

052a337

test: add test for get_hash_flag fucntion

f3cff9e

paulmueller requested changes Jul 24, 2025

View reviewed changes

RaghavaAlajangi added 4 commits July 25, 2025 10:36

enh: implement dynamic hash length logic

4eca74c

update changelog

3112420

ref: trigger hash flag logic only if hash_set contains multiple hashes

e633557

enh: instead of valid hash use longest hash

302f1fc

paulmueller reviewed Jul 26, 2025

View reviewed changes

paulmueller requested changes Jul 26, 2025

View reviewed changes

test: add tests for get_hash_flag function

7c0fbf3

paulmueller approved these changes Jul 30, 2025

View reviewed changes

paulmueller merged commit ab8887d into DC-analysis:master Jul 30, 2025
5 checks passed

paulmueller mentioned this pull request Jul 30, 2025

Notify user when data loaded has different pipeline hashes #217

Closed

RaghavaAlajangi added a commit to RaghavaAlajangi/DCscope that referenced this pull request Jul 30, 2025

Revert "Notify user if two datasets with different hashes are compared (

29ec1f8

DC-analysis#219)" This reverts commit ab8887d.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Notify user if two datasets with different hashes are compared#219

Notify user if two datasets with different hashes are compared#219
paulmueller merged 10 commits intoDC-analysis:masterfrom
RaghavaAlajangi:dcnum_hash_notification

RaghavaAlajangi commented Jul 18, 2025 •

edited

Loading

Uh oh!

RaghavaAlajangi commented Jul 18, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jul 18, 2025 •

edited

Loading

Uh oh!

paulmueller commented Jul 21, 2025 •

edited

Loading

Uh oh!

RaghavaAlajangi commented Jul 21, 2025 •

edited

Loading

Uh oh!

paulmueller commented Jul 21, 2025

Uh oh!

RaghavaAlajangi commented Jul 22, 2025

Uh oh!

paulmueller Jul 24, 2025

Uh oh!

RaghavaAlajangi commented Jul 25, 2025

Uh oh!

paulmueller Jul 26, 2025

Uh oh!

paulmueller left a comment •

edited

Loading

Uh oh!

RaghavaAlajangi commented Jul 28, 2025

Uh oh!

paulmueller left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants



		def test_get_hash_flag():
		rtdc_paths = datapath.glob("*.rtdc")

Conversation

RaghavaAlajangi commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RaghavaAlajangi commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Case-1:

Case-2:

Uh oh!

codecov bot commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

paulmueller commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RaghavaAlajangi commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paulmueller commented Jul 21, 2025

Uh oh!

RaghavaAlajangi commented Jul 22, 2025

Uh oh!

paulmueller Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

RaghavaAlajangi commented Jul 25, 2025

Uh oh!

paulmueller Jul 26, 2025

Choose a reason for hiding this comment

Uh oh!

paulmueller left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RaghavaAlajangi commented Jul 28, 2025

Uh oh!

paulmueller left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RaghavaAlajangi commented Jul 18, 2025 •

edited

Loading

RaghavaAlajangi commented Jul 18, 2025 •

edited

Loading

codecov bot commented Jul 18, 2025 •

edited

Loading

paulmueller commented Jul 21, 2025 •

edited

Loading

RaghavaAlajangi commented Jul 21, 2025 •

edited

Loading

paulmueller left a comment •

edited

Loading