Khanin/merged pool emb opt #4977

kudomcho · 2025-10-06T18:49:34Z

implemented pitch size on tensor allocation for better memory alignment.

netlify · 2025-10-06T18:49:39Z

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
🔨 Latest commit	`6ce50d4`
🔍 Latest deploy log	https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/68e542ed1072590008b27a3e
😎 Deploy Preview	https://deploy-preview-4977--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

meta-codesync · 2025-10-06T18:55:19Z

@q10 has imported this pull request. If you are a Meta employee, you can view this in D83994225.

q10 · 2025-10-06T19:04:28Z

fbgemm_gpu/test/merge_pooled_embeddings_test.py

        with torch.cuda.device(dst_device):
-            inputs = [torch.randn(10, 20) for _ in range(num_inputs)]
+            pitch = True
+            if pitch:


@kudomcho This logic is strange, we set pitch and immediately check its value..

@q10 This is to check the assertion allclose in case pitching is enabled. Currently it forces to True to test the all to one for the pitch condition. Any preference on passing the pitch condition on the test argument?

@q10 This is to check the assertion allclose in case pitching is enabled. Currently it forces to True to test the all to one for the pitch condition. Any preference on passing the pitch condition on the test argument?

Yes, could you make this an argument to the test method, and use hypothesis @given(...) to pass the selection in?

I added use_pitched on the given at test_all_to_one_device and moved the modified the merged_embeddings_benchmark.py to do pitch calculation by adding --use_pitched.

…h calculation for better memory alignment on merged_embeddings_benchmark.py

Summary: X-link: facebookresearch/FBGEMM#1998 implemented pitch size on tensor allocation for better memory alignment. Differential Revision: D83994225 Pulled By: q10

fixed the correctness of merged pool embedding

3a36197

meta-cla bot added the cla signed label Oct 6, 2025

q10 reviewed Oct 6, 2025

View reviewed changes

kudomcho force-pushed the khanin/merged_pool_emb_opt branch from bd261b7 to 3a36197 Compare October 7, 2025 16:17

ufmt formatted, added use_pitch on given at unit test, and added pitc…

6ce50d4

…h calculation for better memory alignment on merged_embeddings_benchmark.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Khanin/merged pool emb opt #4977

Khanin/merged pool emb opt #4977

kudomcho commented Oct 6, 2025

Uh oh!

netlify bot commented Oct 6, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 6, 2025

Uh oh!

q10 Oct 6, 2025

Uh oh!

kudomcho Oct 6, 2025 •

edited

Loading

Uh oh!

q10 Oct 7, 2025 •

edited

Loading

Uh oh!

kudomcho Oct 7, 2025

Uh oh!

Uh oh!

Khanin/merged pool emb opt #4977

Are you sure you want to change the base?

Khanin/merged pool emb opt #4977

Conversation

kudomcho commented Oct 6, 2025

Uh oh!

netlify bot commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Uh oh!

meta-codesync bot commented Oct 6, 2025

Uh oh!

q10 Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

kudomcho Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

q10 Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kudomcho Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

netlify bot commented Oct 6, 2025 •

edited

Loading

kudomcho Oct 6, 2025 •

edited

Loading

q10 Oct 7, 2025 •

edited

Loading