Python library to identify file type based on its file signature
pip install filext
If the category of the file is not certain, you may use whatfile
but is a little slower
from filext import whatfile
file_path = "./tests/files/document.pdf"
# pass file as path str
file_type = whatfile(file_path)
with open(filepath, "rb") as file:
# pass file as bytes
file_type = whatfile(file.read())
If the category of the file is known, you may use the function for that category instead.
from filext import whatdoc
file_path = "./tests/files/document.pdf"
# pass file as path str
file_type = whatdoc(file_path)
with open(filepath, "rb") as file:
# pass file as bytes
file_type = whatdoc(file.read())
- DOC
- PPT
- XLS
- DOCX
- PPTX
- XLSX
- BMP
- GIF
- HEIC
- JPG
- PNG
- TIF