.NET 8 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
-
Updated
Nov 6, 2024 - C#
.NET 8 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.
Identify a file via MIME type and file signature detection.
A script that downloads the NSRL RDS Modern and feeds the SHA-1 as key to a redis server
A simple CLI Tool scripted in Python to check for File types based on MIME types and then comparing them with the extensions.
PrintTracker is used to identify file formats on files that have lost their extension, it can learn to identify new file extensions by using the learn (-l) and print (-p) commands.
Add a description, image, and links to the file-identification topic page so that developers can more easily learn about it.
To associate your repository with the file-identification topic, visit your repo's landing page and select "manage topics."