Hello all,
I am reading a lot of CSV files on an HPC. I have a for
loop which reads csv files, and every time it reads a file, I get an output like this:
indexing 00017942.csv [=====================================] 8.00GB/s, eta
indexing 00025702.csv [=====================================] 7.50GB/s, eta
indexing 00041004.csv [=====================================] 4.49GB/s, eta
indexing 00031140.csv [=====================================] 8.78GB/s, eta
What is this indexing? It seems like it's taking longer than it should to read files.
Any help is appreciated. Thank you.