Skip to content
This repository was archived by the owner on Jan 21, 2025. It is now read-only.

Commit 2b05627

Browse files
Dustin TranCopybara-Service
authored andcommitted
Add paper to README.md.
PiperOrigin-RevId: 220510085
1 parent f3c7bc6 commit 2b05627

File tree

1 file changed

+16
-0
lines changed

1 file changed

+16
-0
lines changed

README.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -389,3 +389,19 @@ python examples/toy_model_tpu.py \
389389
--mesh_shape='all:8' \
390390
--layout='hidden:all'
391391
```
392+
393+
## References
394+
395+
> N. Shazeer, Y. Cheng, N. Parmar, D. Tran, A. Vaswani, P. Koanantakool,
396+
> P. Hawkins, H. Lee, M. Hong, C. Young, R. Sepassi, and B. Hechtman.
397+
> [Mesh-TensorFlow: Deep learning for supercomputers.](https://arxiv.org/abs/1811.02084)
398+
> In _Neural Information Processing Systems_, 2018.
399+
400+
```none
401+
@inproceedings{shazeer2018mesh,
402+
author = {Noam Shazeer and Youlong Cheng and Niki Parmar and Dustin Tran and Ashish Vaswani and Penporn Koanantakool and Peter Hawkins and HyoukJoong Lee and Mingsheng Hong and Cliff Young and Ryan Sepassi and Blake Hechtman},
403+
title = {{Mesh-TensorFlow}: Deep Learning for Supercomputers},
404+
booktitle = {Neural Information Processing Systems},
405+
year = {2018},
406+
}
407+
```

0 commit comments

Comments
 (0)