Document Parser SDK - remove watermark before parsing
If your PDF document contains a watermark and breaks the Document Parser result, you can use the globalTextFilters
option that can be helpful with this problem. These filters remove matching text objects from PDF documents before they get to the parsing engine. Add them to the options
section of the template.
For example, if your PDF contains the watermark Customer Copy
:
"options": {
"globalTextFilters": [
{
"expression": "Customer Copy"
}
]
}