@@ -11,22 +11,67 @@ <h3>Client Downloads</h3>
1111 we no longer provide client tools to submit data.</ p >
1212 < p > If you are interested in computing acoustic features on your own music, you can still download the command-line essentia extractor and run it yourself:</ p >
1313 < ul >
14- < li > < a href ="http ://ftp.acousticbrainz .org/pub/acousticbrainz/essentia-extractor-v2.1_beta2-linux-i686.tar.gz "> linux i386 extractor static binary</ a > </ li >
15- < li > < a href ="http ://ftp.acousticbrainz .org/pub/acousticbrainz /essentia-extractor-v2.1_beta2-linux-x86_64.tar.gz "> linux x86_64 extractor static binary</ a > </ li >
16- < li > < a href ="http ://ftp.acousticbrainz .org/pub/acousticbrainz/essentia-extractor-v2.1_beta2-2-gbb40004-osx.tar.gz "> mac 64 bit extractor static binary for 10.7 and higher</ a > </ li >
17- < li > < a href ="http ://ftp.acousticbrainz .org/pub/acousticbrainz/essentia-extractor-v2.1_beta2-1-ge3940c0-win-i686.zip "> win 32 bit extractor static binary</ a > </ li >
14+ < li > < a href ="https ://data.metabrainz .org/pub/musicbrainz/ acousticbrainz/extractors /essentia-extractor-v2.1_beta2-linux-i686.tar.gz "> linux i386 extractor static binary</ a > </ li >
15+ < li > < a href ="https ://data.metabrainz .org/pub/musicbrainz/acousticbrainzextractors/ /essentia-extractor-v2.1_beta2-linux-x86_64.tar.gz "> linux x86_64 extractor static binary</ a > </ li >
16+ < li > < a href ="https ://data.metabrainz .org/pub/musicbrainz/ acousticbrainz/extractors /essentia-extractor-v2.1_beta2-2-gbb40004-osx.tar.gz "> mac 64 bit extractor static binary for 10.7 and higher</ a > </ li >
17+ < li > < a href ="https ://data.metabrainz .org/pub/musicbrainz/ acousticbrainz/extractors /essentia-extractor-v2.1_beta2-1-ge3940c0-win-i686.zip "> win 32 bit extractor static binary</ a > </ li >
1818 </ ul >
19- < p > < a href ="http ://ftp.acousticbrainz .org/pub/acousticbrainz/sha1sum "> SHA1 sums</ a > </ p >
19+ < p > < a href ="https ://data.metabrainz .org/pub/musicbrainz/ acousticbrainz/extractors /sha1sum "> SHA1 sums</ a > </ p >
2020 < p > Newer versions of the essentia extractor are also available on the
2121 < a href ="https://mtg.github.io/essentia-labs/news/2015/12/22/static-binaries-for-extractors/ "> essentia website</ a > </ p >
2222 </ div >
2323
2424 < div class ="col-lg-6 ">
2525 < h3 > Data Downloads</ h3 >
26- < p > 2022-06-20: We are in the process of finalising data dumps of the complete AcousticBrainz database.
27- These dumps will be announced here and
26+ < p > 2022-07-06: We provide downloadable archives of all submissions made to AcousticBrainz (29,460,584 submissions)</ p >
27+ < h4 > Low-level and High-level json dumps</ h4 >
28+ < p > < a href ="https://data.metabrainz.org/pub/musicbrainz/acousticbrainz/dumps/acousticbrainz-highlevel-json-20220623/ "> High-level downloads</ a > < br >
29+ < a href ="https://data.metabrainz.org/pub/musicbrainz/acousticbrainz/dumps/acousticbrainz-lowlevel-json-20220623/ "> Low-level downloads</ a >
30+ </ p >
31+ < p >
32+ Dumps are split into 30 archives, each with 1 million data files. Archives are compressed with
33+ < a href ="https://facebook.github.io/zstd/ "> zstandard</ a > compression. Filenames inside the archives are structured
34+ such that they will all uncompress into the same location.
35+ </ p >
36+ < p >
37+ Files in each archive are named according to the following structure:
38+ < div class ="well "> < code > type/mb/i/mbid-n.json</ code > </ div >
39+ Where < code > type</ code > is one of < i > lowlevel</ i > or < i > highlevel</ i > ,
40+ < code > mbid</ code > is a uuid of a MusicBrainz Recording Identifier, < code > m</ code > ,
41+ < code > b</ code > , < code > i</ code > and < code > d</ code > are the first, second,
42+ third and fourth characters of the MusicBrainz
43+ Identifier, and < code > n</ code > indicates the ordinal submission offset of duplicate
44+ data files present for the same MusicBrainz Identifier. There will always
45+ be a file with submission offset < i > 0</ i > .
46+ </ p >
47+ < p >
48+ The format of the json files in each archive are described in the < a href ="{{ url_for('data.data') }} "> data page</ a > .
49+ </ p >
50+ < h4 > Sample json dumps</ h4 >
51+ < p > < a href ="https://data.metabrainz.org/pub/musicbrainz/acousticbrainz/dumps/acousticbrainz-sample-json-20220623/ "> Sample downloads</ a > </ p >
52+ < p > The same as the above full dumps, but only containing 100,000 items for small-scale testing.</ p >
53+ < h4 > Low-level feature dumps</ h4 >
54+ < p > < a href ="https://data.metabrainz.org/pub/musicbrainz/acousticbrainz/dumps/acousticbrainz-lowlevel-features-20220623/ "> Feature downloads</ a > </ p >
55+ < p > Smaller CSV files containing some basic features that may be useful for some tasks. Split into three different files based on feature type.
56+ Each file contains 29,460,584 rows of data.
57+ < ul >
58+ < li > < strong > lowlevel:</ strong > average_loudness, dynamic_complexity, mfcc_zero_mean</ li >
59+ < li > < strong > rhythm:</ strong > bpm, bpm_histogram_first_peak_bpm_mean, bpm_histogram_first_peak_bpm_median,
60+ bpm_histogram_second_peak_bpm_mean, bpm_histogram_second_peak_bpm_median,
61+ danceability, onset_rate</ li >
62+ < li > < strong > tonal:</ strong > key_key, key_scale, tuning_frequency, tuning_equal_tempered_deviation</ li >
63+ </ ul >
64+ See the < a href ="https://essentia.upf.edu/streaming_extractor_music.html "> essentia documentation for streaming_extractor_music</ a > for
65+ a description of what each of these features are.</ p >
66+ < h4 > Pending: Data files for acoustic similarity</ h4 >
67+ < p > 2022-07-06: We will provide a downloadable archive of the data files used in the
68+ < a href ="https://acousticbrainz.readthedocs.io/similarity.html "> recording similarity API</ a > .</ p >
69+ < h4 > Pending: Low-level and High-level dump of deduplicated items</ h4 >
70+ < p > 2022-07-06: We will provide new json and feature dumps of the database after de-duplicating to only one instance of each recording MBID
71+ (approximately 7 million items)</ p >
72+ < p > Pending dumps will be announced here and
2873 < a href ="https://community.metabrainz.org/c/acousticbrainz/5 "> on the AcousticBrainz forum</ a > in the coming weeks.
29- </ p >
74+ </ p >
3075 </ div >
3176
3277 </ div >
0 commit comments