Each data sample has 5 data files:
{i}_emg.npy - a saved numpy array of size (T, 8) with the raw EMG signals
{i}_audio.flac - the raw audio recording
{i}_audio_clean.flac - audio with background noise removed
{i}_info.json - a JSON with extra information, such as the text prompt that was read
{i}_button.npy - a numpy array containing device button state, which is generally unused

Note that some samples do not represent actual datapoints, but are used as reference EMG or audio signals.  These samples are marked with "sentence_index: -1" in the associated info file.
