[hyperactor] channel-level ping-pong benchmarks #906

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

mariusae wants to merge 5 commits into gh/mariusae/41/base from gh/mariusae/41/head

+182 −4

Member

mariusae commented Aug 18, 2025 •

edited

Loading

Stack from ghstack (oldest at bottom):

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.

Differential Revision: D80260732


          [hyperactor] channel-level ping-pong benchmarks

9153a3e

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.

Differential Revision: [D80260732](https://our.internmc.facebook.com/intern/diff/D80260732/)

[ghstack-poisoned]

This was referenced Aug 14, 2025

[serde_multipart] new crate: multipart codec for serde #831

Open

[hyperactor] Serialized: permit multiple encoding representations #832

Open

[serde_multipart] serializable Message #852

Open

[serde_multipart] framed multipart encoding #876

Open

[hyperactor] Serialized: multipart support #905

Open

[hyperactor] net: zero copy framer #907

Open

meta-cla bot added the CLA Signed label

Contributor

facebook-github-bot commented Aug 18, 2025

This pull request was exported from Phabricator. Differential Revision: D80260732

facebook-github-bot added the fb-exported label


          Update on "[hyperactor] channel-level ping-pong benchmarks"

98e1a3a

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.

Differential Revision: [D80260732](https://our.internmc.facebook.com/intern/diff/D80260732/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Aug 18, 2025

This pull request was exported from Phabricator. Differential Revision: D80260732


          Update on "[hyperactor] channel-level ping-pong benchmarks"

d8915b2

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.

Differential Revision: [D80260732](https://our.internmc.facebook.com/intern/diff/D80260732/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Aug 18, 2025

This pull request was exported from Phabricator. Differential Revision: D80260732


          Update on "[hyperactor] channel-level ping-pong benchmarks"

29704b8

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.

Differential Revision: [D80260732](https://our.internmc.facebook.com/intern/diff/D80260732/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Aug 18, 2025

This pull request was exported from Phabricator. Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

e56d868

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303773788
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

145e288

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303773788
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732


          Update on "[hyperactor] channel-level ping-pong benchmarks"

b5e749f

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.

Differential Revision: [D80260732](https://our.internmc.facebook.com/intern/diff/D80260732/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Aug 18, 2025

This pull request was exported from Phabricator. Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

a0f38e5

Summary:
Pull Request resolved: meta-pytorch#906

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303699220
exported-using-ghexport

Differential Revision: D80260732

Reviewed By: highker, vidhyav

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

8a36ec5

Summary:
Pull Request resolved: meta-pytorch#906

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303699220
exported-using-ghexport

Differential Revision: D80260732

Reviewed By: highker, vidhyav

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

425bb37

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

76391bf

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

80229e4

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

f2f0116

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

8f8cca8

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

c69242c

Summary:
Pull Request resolved: meta-pytorch#906

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303699220
exported-using-ghexport

Differential Revision: D80260732

Reviewed By: highker, vidhyav

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

c27c68a

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

50a23bd

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

64caeea

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

c16a517

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

shayne-fletcher pushed a commit to shayne-fletcher/monarch-1 that referenced this pull request


          channel-level ping-pong benchmarks (meta-pytorch#906)

6e5af83

Summary:

This is an attempt to do an apples-to-apples comparison to P1903314366, to eliminate any non-channel related overheads.

The results replicate previous findings: that our throughput is hampered by excess data copies (either outright or through growing buffers in the encoding stack), and of tokio-level network i/o overheads. Both are being addressed. These benchmarks should help to serve as validation as this work lands.
ghstack-source-id: 303814001
exported-using-ghexport

Reviewed By: highker, vidhyav

Differential Revision: D80260732

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported