Compliance Readiness Limitations
Please find below the limitations for docx/PDF files uploaded in Compliance Readiness:
Doc Parser limitations:
Documents that do not have consistent fonts, styles through the document will not be correctly parsed
The title should have the max font size in the document else it might not be identified
Section headings should have the same font style to correctly split sections
Does not extract images/diagrams
DOCX
Does not extract URLs/markup text
Does not extract header/footers
Does not parse tables with any merged rows/cols
List points not auto-numbered will be detected as paragraph text; not list point
Superscripts are not aligned
List points that do not appear in the text are missing (usually pdf generated by print option)
Tables without borders might not parse correctly
Single-cell/column/row table will be discarded
LIST POINTS
List points starting with a verb are merged with the previous sentence
List points supported:
Unordered
Disc
Ordered
Alpha
Lower-alpha
Decimal
Roman
Lower-roman
TABLES
Tables without headers might not make sense when displayed
Tables without merged cols/rows would be properly displayed
β