Input formats

Describes the file format of source documents.

Format features

The table below presents the format that are managed and for each format the available features.

Microsoft Office formats

Format Requirement for reading Extensions Track Change Supported Page Number Supported Heading Number Supported Table cell recognition supported Style name filter
Microsoft WORD >2007 .docx .docm NO NO YES YES YES
Microsoft WORD 97 MS Word must be installed .doc YES NO YES NO NO
Microsoft EXCEL 2007 .xlsx .xlsm N/A N/A NO YES, each cell is read as a new line. NO
Microsoft POWERPOINT 2007 .pptx N/A YES (slide) NO NO NO
Microsoft EXCEL 97 MS Excel must be installed .xls N/A N/A NO YES, each cell is read as a new line. NO

Comments in source Code

Format Requirement for reading Extensions Track Change Supported Page Number Supported Heading Number Supported Table cell recognition supported
C/C++ source code .cpp .hpp .c .h .inc NO N/A NO NO
Java source code .java NO N/A NO NO
JavaScript source code .js NO N/A NO NO
C# source code .cs NO N/A NO NO
XML file .xml NO N/A NO NO
Python source file .py NO N/A NO NO
Python for Windows script .pyw NO N/A NO NO
Windows batch script .bat NO N/A NO NO
Linux shell script .sh NO N/A NO NO

Other formats

Format Requirement for reading Extensions Track Change Supported Page Number Supported Heading Number Supported Table cell recognition supported Style name filter
Adobe PDF .pdf N/A YES YES (1) YES (1) NO
OpenOffice WRITER OpenOffice WRITER must be installed .odt NO YES YES NO YES
REQCHECKER XLSX reports .xlsx NO NO NO NO NO
Markdown (basic support) .md NO NO NO NO NO

Warning

(1) Only some PDF heading number patterns are supported. The experimental heuristic algorithm automatically detects headings. The table of content is ignored. Standard patterns like "1.1 1.2.1.." and "1. A. 1. B.." are supported. Only some PDF structures are supported. The table cell recognition for PDF is an experimental feature.