Skip to content

Excel file data source

This type of data source imports tabular data from a Microsoft Excel spreadsheet. Please refer to the generic sources edition documentation for standard elements, like the preloading actions, adapters, filter, modelling etc.

Parsers

CogTL allows to load Excel file using two different modes:

Standard parser (by default)

Based on Apache POI, offers the possibility to load small and medium-sized spreadsheets (depending on the quantity of memory available on the server or agent performing the loading operation), supports more complex spreadsheets, with multiple different cell formats, and older XLS files (Excel 97-2003). However, you shouldn't use it on large Excel files (several thousands of rows), as it needs a lot of memory and may reach to memory saturation of your server or agent.

Streaming parser

This parser only works with XLSX files (open document format), but has a small memory footprint and is able to read much larger Excel files. However, some more complex cells formats may not be correctly interpreted, so double-check the imported data.