Import large .par file
显示 更早的评论
I need to import a .par file with 2.5gb of dimension. The .par file is a numerical table with 10 columns and few millions of rows.
I tried to used the parquetread function but i got an error message ("Unable to read Parquet"). A reduced version of the parquet file, 500mb, is uploaded successfully. My conclusion is that the original .par file is too large.
Any suggestion on how to import large .par file?
回答(1 个)
Saffan
2023-5-29
0 个投票
Hi Carlo,
One way to solve this issue could be by splitting the original parquet file into multiple smaller files and create “parquetDatastore” of them. The data can then be obtained from the datastore using the “read” function. A datastore is a repository of files where each individual file can fit in the memory but the entire collection of files may not fit. They are used to work with the data incrementally and hence reduces the memory usage.
You can refer to the following link for more information on datastores: https://www.mathworks.com/help/releases/R2022b/matlab/ref/datastore.html
类别
在 帮助中心 和 File Exchange 中查找有关 Large Files and Big Data 的更多信息
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!