Unlock the power of unstructured content with Document Filters

Share This Post

Hyland’s Document Filters is a powerful toolkit that helps organizations unlock the power of unstructured content through file identification, content extraction and format conversion.

Organizations from a wide range of industries and use cases such as eDiscovery, legal compliance, artificial intelligence and machine learning have all found value in the capabilities delivered by Document Filters.

Let’s take a look at the exciting new components and features to Document Filters and how they can positively impact your 2023 goals.

7 new formats now ready for Document Filters

Included in our updates are seven new formats (listed below).

Delivering new formats is one of the main value drivers Document Filters looks to consistently provide because we know our customers want to use our toolkit to process any file they could possibly get. Being able to work with every file is especially important for our security and compliance customers who don’t want any documents passing through their systems that they’re not able to analyze.

The new formats include:

  • Apple Pages 2009 files
  • Apple Keynote 2009
  • Self-extracting 7Zip EXE files
  • Self-extracting RAR EXE files
  • Self-extracting ZIP EXE files
  • Adobe Photoshop Large Document Files
  • AutoCAD 2018+

5 other major Document Filter enhancements

In 2022’s four Document Filters releases (which included 35 new features and 141 security and quality updates), customers will find there are a lot of improvements to take advantage of!

1. Document Filters is now available on NuGet

To get started we want to mention a new delivery method we’ve implemented starting with the latest release of 22.4.

Document Filters is now available in NuGet! This means all the great features we’ll be talking about and continuing to deliver in the future can be downloaded and implemented that much easier in environments that support downloading from NuGet. NuGet packages can be found in the Hyland Software profile provided below.

Additionally, if you’re not a current Document Filters customer but are interested in trying it out, you can still download from NuGet. You will need a trial license to get started, and this can be requested at the link below.

Helpful content:

2. Support for Apple M1 architecture

We know our customers want to run Document Filters anywhere. It’s important we meet your needs and don’t dictate where your application can run. That’s why we now run natively on the Apple M1 architecture. This brings the total number of supported platforms and architectures to 29!

Learn more about this in this video, starting at the 8:27 mark.

Document Filters 22.1.0 Release Demo

3. HD comment and annotation extraction

Document Filters can now pull text, coordinate information and do anything else you could want from comments and annotations in Word, Excel and PowerPoint. We always want to provide the most information possible about any document. Comments and annotations can have special meaning, and that can change based on where they’re placed or other information in them. You can use this new capability to directly output into converted formats, create summary pages or even use in conjunction with our robust annotation API to change them when converting.

See some code examples of how we’re able to utilize this to create summary pages and output the annotations to PDF in the 22.1 video embedded above, starting at the 1:34 mark.

4. Native Python bindings

Again, aligning with the theme of running anywhere our customers may need us, we have updated our Python bindings. The previous bindings were aligned with specific versions of Python, which made it difficult for some as Python released new versions. The new bindings are version-agnostic and will work with any version going forward.

See some exciting ways our architect was able to utilize the bindings with machine learning code in the following video, starting at the 5:32 mark.

Document Filters 22.2.0 Release Demo

5. Enhanced PDF redaction support

This is an exciting new feature that allows an area of a PDF to be redacted while maintaining the rest of the page as text. Previously the only secure way of redacting a PDF would have been to render the entire page as an image prior to redacting the area which would have caused the PDF to be much less valuable for the end-user.

Hear more about how this works and some of the pitfalls you might find with other toolkits in this video, starting at the 0:51 mark.

Launch your document processing journey with Document Filters

Document Filters makes it easier, faster and more efficient to work with structured and unstructured content. Watch 2022’s most recent release videos below.

Learn more about Document Filters.

Document Filters 22.3.0 Release

Document Filters 22.4.0 Release Demo

About Hyland

Hyland is privately held company and a leading content services provider. We enable thousands of organizations to focus on what they do best and deliver better experiences to the people they serve.

The power of connection drives Hyland. From connecting technology systems and data to connecting co-workers, teams and global communities, Hyland believes in transforming digital interactions into meaningful outcomes for customers, partners and our own employees.

More Articles


Navigating SEC Regulations In Cybersecurity And Incident Response

Free video resource for cybersecurity professionals. As 2024 approaches, we all know how vital it is to keep up to date with regulatory changes that affect our work. We get it – it’s a lot to juggle, especially when you’re in the trenches working on an investigation, handling, and responding to incidents.


BFU – Seeing is Believing

Oh no, the device is in BFU. This is the common reaction; a device needs extracting, and you find it in a BFU state. Often, there’s an assumption that a BFU extraction will only acquire basic information, but that isn’t always the case.