If you make use of GuitarSet for academic purposes, please cite the following publication:
Q. Xi, R. Bittner, J. Pauwels, X. Ye, and J. P. Bello, "Guitarset: A Dataset for Guitar Transcription", in 19th International Society for Music Information Retrieval Conference, Paris, France, Sept. 2018. This project was lead by Qingyang Xi at NYU's Music and Audio Research Lab, along with Rachel Bittner, Xuzhou Ye and Juan Pablo Bello from the same lab, as well as Johan Pauwels at the Center for Digital Music at Queen Mary University of London. |
Introducing GuitarSet, a dataset that provides high quality guitar recordings alongside rich annotations and metadata.
In particular, by recording guitars using a hexaphonic pickup, we are able to not only provide recordings of the individual strings but also to largely automate the expensive annotation process, therefore providing rich annotation.
The dataset contains recordings of a variety of musical excerpts played on an acoustic guitar, along with time-aligned annotations including pitch contours, string and fret positions, chords, beats, downbeats, and playing style.
In particular, by recording guitars using a hexaphonic pickup, we are able to not only provide recordings of the individual strings but also to largely automate the expensive annotation process, therefore providing rich annotation.
The dataset contains recordings of a variety of musical excerpts played on an acoustic guitar, along with time-aligned annotations including pitch contours, string and fret positions, chords, beats, downbeats, and playing style.
Audio ContentGuitarSet contains 360 excerpts that are close to 30 seconds in length. The 360 excerpts are the result of the following combinations:
- 6 players each plays the same 30 lead sheets in...
- 2 versions: comping and soloing, where the player record the comping version first and later solo on top of his own comping. The 30 lead sheets are generated from a combinations of
- 5 styles: Rock, Singer-Songwriter, Bossa Nova, Jazz, and Funk - 3 Progressions: 12 Bar Blues, Autumn Leaves, and Pachelbel Canon. - 2 Tempi: slow and fast. |
Audio Collection SetupAudio are recorded with the help of a hexaphonic pickup, which outputs signals for each string separately, allowing automated note-level annotation.
Players are provided with lead sheets and backing tracks reflecting the correct style that includes a drum kit and a bass line.
Excerpts are recorded with both the hexaphonic pickup and a Neumann U-87 condenser microphone as reference.
3 audio recordings are provided with each excerpt with the following suffix: - hex: original 6 channel wave file from hexaphonic pickup - hex_cln: hex wave files with interference removal applied - mic: monophonic recording from reference microphone |
Annotation ContentEach of the 360 excerpts has an accompanying .jams file that stores 16 annotations.
Pitch:
- 6 pitch_contour annotations (1 per string) - 6 midi_note annotations (1 per string) Beat and Tempo: - 1 beat_position annotation - 1 tempo annotation Chords - 2 chord annotations (instructed and performed)* *The instructed chord annotation is a digital version of the lead sheet that's provided to the player, and the performed chord annotations are inferred from note annotations, using segmentation and root from the digital lead sheet annotation.
|