I discovered a crazy bug today trying to import data from an Excel file into a Laravel app.
Modern Excel files are actually zip files containing several XML files in a specific directory structure, but it seems that within the zip file the "directory structure" is not real, so each XML file just has some directory structure like stuff prepended to its file name.
In the particular Excel file I'm importing, the XML file that knows about the different worksheets in the Excel workbook was saved with the "/" characters in its directory structure as backslashes. This seems very unexpected, none of the Excel packages I tried for PHP could read it.
In the end I used PHP's built-in ZipArchive class to rename the XML file in the Excel archive, then it could be imported properly.
That wasn't how I expected to spend my afternoon!
#UntrustedData #PHP #DataIngestion #WeirdBugs #DeveloperLife
#untrusteddata #php #dataingestion #weirdbugs #developerlife
#DataIntegration vs. #DataIngestion: What are the differences?
#mst #digitaltransformation #datamanagement #data #dataingestion #dataintegration
Informationsspezialist*in (m/w/d), TV-L E 13, Vollzeit, auf 2 Jahre befristet Leibniz-Institut für Psychologie (ZPID) | Bewerbungsfrist: 01.10.2022 https://leibniz-psychology.onlyfy.jobs/job/za5hnrmo | https://jobs.openbiblio.eu/stellenangebote/80589 #openbibliojobs #dataingestion #Repositorium #OpenAccess #DSpace
#dspace #openaccess #Repositorium #dataingestion #openbibliojobs