Skip to content

Michael-Sebero/Document-Tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 

Repository files navigation

How to Install

pip install pyzipper textract python-magic PyPDF2 python-docx chardet python-pytesseract requests beautifulsoup4 cssutils

git clone https://github.com/Michael-Sebero/Document-Tools

python3 /home/$USER/Document-Tools/document-tools.py

Compare Documents

This compares two documents and lists the similarities and differences to an output file.

Duplicate Line Remover

This detects duplicate lines in a file, removes them and then saves the changes to an output file.

Extract Text

This extracts text from an image or a directory full of images.

Find Word

This looks in a given directory recursively for keywords in documents and tells you where you can find them.

Keyword Line Extractor

This looks for keywords in a file and extracts lines where they're found to an output file.

Replace Keyword

This replaces keywords in a file.

Web Font Extractor

This fetches a webpage and downloads all fonts found in its stylesheets to ~/Downloads/Fonts/. Supports TTF only or all formats (TTF, OTF, WOFF, WOFF2, EOT).

Contributors

Languages