Describe the bug
When specifying output_format as csv, the response from the api is different when split_pdf_page is True or False. When False, the elements contain an extra metadata field: text_as_html. This also means the element id does not match.
To Reproduce
_test_unstructured_client/integration/test_decorators.py::test_integration_split_csv_response illustrates this, but is passing because it asserts on a shortened string.
Expected behavior
The response to be identical whether or not split_pdf_page is True or False.
Describe the bug
When specifying
output_formatascsv, the response from the api is different whensplit_pdf_pageisTrueorFalse. WhenFalse, the elements contain an extra metadata field:text_as_html. This also means the element id does not match.To Reproduce
_test_unstructured_client/integration/test_decorators.py::test_integration_split_csv_responseillustrates this, but is passing because it asserts on a shortened string.Expected behavior
The response to be identical whether or not
split_pdf_pageisTrueorFalse.