read ms word with python
import textract text = textract.process("path/to/file.extension")
Here is what the above code is Doing:
1. Importing the textract module
2. Running the textract.process method on your file.
The textract.process method will return the text of the file as a string.