Support for CSV and similar formats:
Issues in the testing phase:
I have encountered problems and inconsistencies when processing CSV files. While some formats such as XLSX and structured text files (TXT/CSV) are processed without problems, I have encountered difficulties when processing large files (> 2 GB), some delimiters are not properly recognized, and file rendering anomalies are observed.
- Can you elaborate on specific issues and inconsistencies encountered during the testing phase when processing CSV files?
- Are there any limits on the size of CSV files that Aleph can process?
- Which delimiters are not recognized properly, and what is the expected behavior?
- Can you provide more details about the observed file rendering anomalies?
Specific errors:
Separators are not recognized:
- The following delimiters are not properly handled: “|”, " ", “±”.
- Problem with email address field: In a file with 3 fields (separated by “±” characters), the email address field was separated by “@” characters.
- Failure when processing large files: When processing files larger than 200 MB, strange behavior was observed: the process stopped at a certain iteration and the file was not displayed in the client.