Convert Data to RecordIO Files
- Download Kaggle Display Advertising Challenge Dataset by link
- Untar the “dac.tar.gz” and you will get a “train.txt”
- Execute the command
python convert_to_recordio.py \
--records_per_shard 400000 \
--output_dir ./dac_records \
--data_path xxx/dac/train.txt