I found that there is a JPegDecoder in the Atalasoft software. In order to convert the images, you need a similar function as the PDF converter. Philo,. Hi, I’m the support engineer you called in to yesterday. I apologize – after you called in, I received a note from our chief software architect asking us to help . 32 results Atalasoft DotImage Document Imaging is an SDK that offers high-speed document and image conversion, viewing and annotation on any device.

Author: Grokora Gugrel
Country: Equatorial Guinea
Language: English (Spanish)
Genre: Health and Food
Published (Last): 1 October 2010
Pages: 377
PDF File Size: 4.55 Mb
ePub File Size: 9.28 Mb
ISBN: 403-2-21676-157-8
Downloads: 79270
Price: Free* [*Free Regsitration Required]
Uploader: Kigajind

A couple of tings that come to mind from ztalasoft case: It might be because LZW compression is not Bitonal and the above code doesn’t handle anything but 1bpp. Anytime I try to convert a jpeg to tiff, an issue arises because the image is an AtalaImage and not a System.

c# – using AtalaSoft to convert Tiff compression – Stack Overflow

Also, can you define a region to “search” for text by giving x and y coordinates? Will this work to perform OCR on images which are not documents, but contain text? When opening the PDF into Acrobat Tiiff see screenshot belowall text in the document can be selected as real text, even though the visible part of this PDF is the actual color rasterized image.

First Prev Next unable to write to a conveft file. Then I want to remove the text, so all I have left is the images that were on the pages. As you can tell, this is not very memory efficient for large documents. The PdfEncoder in DotImage does not allow us to save a single page to an existing PDF file, so we must have all the images ready when we save the file. The adobe reader version is 8. See a recent post in this thread for more information.


Atalasoft Knowledge Base

The TiffDocument takes a stream as a parameter hence why they are stream functions. What we want is a document format that looks like the original images when humans look at it, but that looks like plain text when the indexer atalawoft at it. Article has been viewed times. By using our site, you acknowledge that you have read and understand our Cookie PolicyPrivacy Policyand our Terms of Service. Reading is slow compared to listening, I guess. Using Atwlasoft free SDK, http: How about a working demo app Jeff Circeo Dec 6: I just want to locate the position of all the text, the boxes which contain all the text on the page.

As you can see from the following example, the first way is cnovert easier to implement, ho the second way will conserve a lot of memory. I apologize – after you called in, I received a note from our chief software architect asking us to help you out. Decompress the image Pre-process the image to make OCR more accurate including cleaning it or deskewing it OCR the image to extract the text. Article has been viewed times.


I gave the Infile path of my D drive where the pdf file is present and outfile path with a folder in D drive. To do this we need to: Does it have to be a scanned document?

This technology already exists Days after posting this message I decided to try it in the lounge and there I realized that it already exists, perhaps not like what is in my atallasoft, but another version.

This article is in the Product Showcase section for our sponsors at CodeProject.

Thanks, The x86 reference worked. The code below is the same as the code in the link:. Atalaoft about a working demo app Bill Bither Dec 6: Save outStream, img, null ; img.

Converting Scanned Document Images to Searchable PDFs with OCR

Simply having this file on your filesystem will cause Google Desktop Search, or Windows Desktop Search to index this document properly, with the document looking exactly like the original. Hamed Mosavi Dec The answer is “because the file you opened did not contain data that any ImageDecoder in the RegisteredDecoders.

Email Required, but never shown.