These are some scripts that can be used to convert pdf locally and speed up the formatting of the words when copying and pasting from PDF.
- Click on the green
Code
button - Ensure that you are in the clone tab
- Click on
Download ZIP
- Extract the folder to the directory you want
- Open Window Powershell
- Use the cd command to go to the directory of the script.
e.g. if your file is in the Document folder, run
cd C:\User\name\Documents\automation_scripts
- run
.\<nameOfScript>.ps1
- Perform the task
- Hit enter
- Press Control + C to terminate the program except for pdfToWord
This removes the multiple spaces when the text is justified
This will remove trailing spaces like the previous function and split by delimiter.
This is useful when there are bullet points that come out with messed-up formatting. This will split them point by point and each point will be in a new line.
- Note: For some of the delimiters using bullets, they cannot be read by the compiler, thus you will need to find the Unicode for the delimiter.
e.g. $delimiter = [char]::ConvertFromUtf32(8226) # Unicode code point of
โข
you will need to google the Unicode for the specific delimiter you are using, then replace the code in the file.
Opens a prompt to load a pdf and convert it to word document.