Input formats

Describes the file format of source documents.

Format features

The table below presents the format that are managed and for each format the available features.

Microsoft Office formats

Format Requirement for reading Extensions Track Change Supported Page Number Supported Heading Number Supported Table cell recognition supported
Microsoft WORD 2007 .docx .docm NO NO YES YES
Microsoft WORD 97 MS Word must be installed .doc YES NO YES NO
Microsoft EXCEL 2007 .xlsx .xlsm N/A N/A NO YES, each cell is read as a new line.
Microsoft POWERPOINT 2007 .pptx N/A YES (slide) NO NO
Microsoft EXCEL 97 MS Excel must be installed .xls N/A N/A NO YES, each cell is read as a new line.

Comments in source Code

Format Requirement for reading Extensions Track Change Supported Page Number Supported Heading Number Supported Table cell recognition supported
C/C++ source code .cpp .hpp .c .h .inc NO N/A NO NO
Java source code .java NO N/A NO NO
JavaScript source code .js NO N/A NO NO
C# source code .cs NO N/A NO NO
XML file .xml NO N/A NO NO
Python source file .py NO N/A NO NO
Python for Windows script .pyw NO N/A NO NO
Windows batch script .bat NO N/A NO NO
Linux shell script .sh NO N/A NO NO

Other format

Format Requirement for reading Extensions Track Change Supported Page Number Supported Heading Number Supported Table cell recognition supported
Adobe PDF .pdf N/A YES YES (1) YES (1)
OpenOffice WRITER OpenOffice/LibreOffice WRITER must be installed .odt NO YES NO NO
REQCHECKER XLSX reports .xlsx NO NO NO NO

Warning

(1) Only some PDF heading number patterns are supported. The experimental heuristic algorithm automatically detects headings. The table of content is ignored. Standard patterns like "1.1 1.2.1.." and "1. A. 1. B.." are supported. Only some PDF structures are supported. The table cell recognition for PDF is an experimental feature.