RecognizeToFile output not as expected

May 1, 2011 at 5:34 AM

Hello,

I am using the following sample code:

 



            var pumaPage = new PumaPage(file);
            using (pumaPage)
            {
                pumaPage.FileFormat = PumaFileFormat.TxtAscii;               
                pumaPage.EnableSpeller = false;
                pumaPage.Language = PumaLanguage.English;
                pumaPage.RecognizeToFile(@"c:\temp\page001.txt");
            }

 

on a picture I downloaded froom here

http://www.google.ca/imgres?imgurl=http://jurnsearch.files.wordpress.com/2009/07/ocr-test.jpg&imgrefurl=http://www.sciweavers.org/i2ocr&h=728&w=567&sz=130&tbnid=GAW8zZEJsLqEeM:&tbnh=254&tbnw=198&prev=/search%3Fq%3Docr%2Btest%2Bimages%26tbm%3Disch%26tbo%3Du&zoom=1&q=ocr+test+images&usg=__rh9ayeWPxNVkjrrJ8rI6NICq5ck=&sa=X&ei=0vC8TZrkGIS4twfU29nHBQ&ved=0CCgQ9QEwAA

and I get this in my output file:

Tnt. laiilu umn f tli it'oika f,f inn ffcua, it i~ tf1n t I, iiill ctteu 1 tu h'n< t ufunti It ii i. Ihouhftt.il i al1 to rnmmcnro thr in i ith ini fii tinf' uf,ic il 5 rmntiut i ~ h tl.mi. ni thc nuil. uf n ateit.
iuifiutt.tttcc. The niit i lumo iiill tlni omhiin tliv'tfurif iut f Ic urtfi huuh i hicli iuutiunc th lfi turl toic:ir l. I'I:it iilnili 1iri 1 ini Inctun>l Lihuurcuniifercl t tvrnin,iti: lint thc Iciftli Iiuuk,
f' i'i iiii',i . i' fii 'I tn tlii' 1Ii tui'v, aiii I f iii li Ii ' I iiii loi
nai i m iittt, iill;il 1i incliuiml. Ih I,ittiraai 1 tli-i II'uiruu. Iiriun aili Ii airnn i I m tlui
t liiiiici. ii iii'iii'lr ii 1 ii ibii.' iii i'lii' ii\ 1 Ii'<'i' . iai('li 1 i'I, ii I iii< ii fi liirc I I I ii i1;ii,it

. v lirtm tlc I ~ in rtfI ur lrmti I <lm ~ fr uiri:v iia c 1i ri tai,in

.", n, 1 ih:in lc i.iliircI rint a l,iti I thi Auth rII 1> hi n irvfiicI t tln> Innir '1'ftc Lifi. II i)a;ll'('iiit i inni iii,<rh uniicr.iiili
iiiii 'I I I I 1i,il iiliii.:i t iiif i i' I '
im

 

Looks like I am missing a setting?