So for reasons I cannot explain, I've wound up with paper copies of some VBA code that I want to use.
I've gone through a scanning process and corresponding OCR, (all on a Linux box, BTW) which works pretty well....
but, and there is always a but;
Some of the text is obviously OCR'ed in a non-standard text encoding.
I've imported it into the VBA IDE and the odd line simply will not run.
If I type the exact same line, character by character, it will run, thus the encoding conclusion.
Any thoughts as to a process to run through to ensure the text encoding is the one that the VBA IDE wants to see?
I tried changing the encoding to ANSI in Notepad++ but that had no effect.
Thanks for any thoughts.
Oddball text question
Oddball text question
chris
Re: Oddball text question
I have seen issues with the double quotes using similar process, so I type them and then all is good. So, check if this is the case with your line.
Deepak Gupta
SOLIDWORKS Consultant/Blogger
SOLIDWORKS Consultant/Blogger
-
- Posts: 82
- Joined: Thu Jan 20, 2022 3:35 pm
- x 31
- x 91
Re: Oddball text question
I expect @gupta9665 is on to something with the quotations.
Be careful on ANSI (8 bits) v ASCII (7 bits). They overlap on the first 128 characters, but ANSI has another 100 or so odd-balls. These include a few variations on quotation marks and other stuff that isn't on a typical keyboard. I could easily see your scanner picking something obscure. I'd try again with ASCII encoding and see if that works.
Be careful on ANSI (8 bits) v ASCII (7 bits). They overlap on the first 128 characters, but ANSI has another 100 or so odd-balls. These include a few variations on quotation marks and other stuff that isn't on a typical keyboard. I could easily see your scanner picking something obscure. I'd try again with ASCII encoding and see if that works.
Re: Oddball text question
What OCR software are you using on the Linux machine?
Also, can you upload one of the misbehaving files?
Also, can you upload one of the misbehaving files?
Re: Oddball text question
Bingoryan-feeley wrote: ↑Sun Feb 18, 2024 1:13 pm I expect @gupta9665 is on to something with the quotations.
Be careful on ANSI (8 bits) v ASCII (7 bits). They overlap on the first 128 characters, but ANSI has another 100 or so odd-balls. These include a few variations on quotation marks and other stuff that isn't on a typical keyboard. I could easily see your scanner picking something obscure. I'd try again with ASCII encoding and see if that works.
I'm using OCRFeeder - https://wiki.gnome.org/Apps/OCRFeeder
Switching to ANSI via Notepad++ converted some of the quotation marks into weird characters.
Thank you!
chris