Skip to content

Commit 8ad095c

Browse files
authored
Update README.md
1 parent f50f5a9 commit 8ad095c

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

README.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,5 @@
1-
# the-stack-v2
1+
# The Stack v2 & StarCoder2Data
2+
3+
In this repository you can find the code for building The Stack v2 dataset, as well as the extra sources used to make StarCoder2data: the training corpus of the StarCoder2 family of models.
4+
5+
This reposirory is a follow-up of on the work in [bigcode-dataset](https://github.com/bigcode-project/bigcode-dataset/) used for [The Stack v1](https://huggingface.co/datasets/bigcode/the-stack) and [StarCoderData](https://huggingface.co/datasets/bigcode/starcoderdata).

0 commit comments

Comments
 (0)