Itext Pro 1 2 5 – Ocr Tool Software

broken image


Get Your Free OCR API Key

  1. Itext Pro 1 2 5 – Ocr Tool Software Download
  2. Itext Pro 1 2 5 – Ocr Tool Software Download
  3. Itext Pro 1 2 5 – Ocr Tool Software Free

Kerbal Space Program 1 1 2 Download Free Joyoshare Media Cutter 3 2 0 43 12 Integrity Plus 6 11 16 Download Free Roadblock 1 5 8 Ip Scanner Pro 3 44 Download Free Itext Pro 1 2 5 – Ocr Tool Installer Pdf Protector 1 4 13 Chatty For Facebook 1 7 Flume Pro 2 8 6 3 Download Free Spideroak 7 3 0. IMazing Crack Itext pro 1 2 5 – ocr tool key. Is a multimedia software to share a different type of file from one device to another. It is trusting able software that you want. Istumbler 103 3 download free. So, if you want to manage your data then iMazing crack is best for all that.

Register here for your free OCR API key. The OCR API provides a simple way of parsing images and multi-page PDF documents (PDF OCR) and getting the extracted text results returned in a JSON format. The OCR API has three tiers/levels. The free OCR API plan has a rate limit of 500 requests within one day per IP address to prevent accidental spamming.

For even faster response times and guaranteed 100% uptimePRO plans are available. The PRO OCR API runs on physically different servers than our free OCR API service. You receive the URLs for the global PRO endpoints and your API key in the welcome email directly after you have signed-up for the PRO or PRO PDF account. The PRO OCR API can also be purchased as a locally installable on-premise OCR software.

API PlanFreePROPRO PDFEnterprise
PricingFree $30/month $60/month $299+/month
Sign-Up
& Get API Key
Register for
free API key
Buy PRO
API Key
Buy PRO PDF
API Key
Contact Sales
Requests/month25,000300,000300,000Custom
Additional conversions*n/aUS$10/100,000US$20/100,000Included
File Size Limit1 MB5 MB100 MB+100 MB+
PDF Page Limit 33999+999+
Searchable PDF CreationYes (with watermark)YesYesYes
SpeedFastFaster (more servers, lower load)Fastest (Your own server)
Rate Limit**500 calls/DAY6000 calls/1h6000 calls/1hCustom
Service-level agreement (SLA)n/a100% uptime or money back
(dedicated, redundant servers in USA/EU/Asia)
Custom location(s)

*Additional conversions: We do not charge for extra conversions automatically. Instead, if you reach the limit, we will contact you and you can decide if you want to pay for additional conversions or stop for the current billing period.

**For the PRO plans, we can adjust the default rate limit if needed.

You can check the API performance and uptime at the API status page.
Now it's time to get started: Below you find example code for calling the API from Postman, AutoHotKey (AHK), cURL, C#, ASP.NET, Delphi, iOS, Java (Android app), Node.JS NPM, Python, C++/QT, Ruby, and Javascript. (If you have code examples for other languages, please let us know and we will add them to this list).

Top

Faster OCR with the PRO plans

For our OCR PRO plans we use redundant high-performance API endpoints in the US, EU and Asia regions. We guarantee 100% uptime or money back. Our hosted PRO OCR plans:

  • - Monthly plans, cancel anytime. To subscribe, please see the links in the table above. If the embedded checkout does not work in your browser, please use our classic web store for the PRO and PRO PDF plans.

  • - Yearly pre-paid plans with a 20% discount (Order PRO yearly and Order PRO PDF yearly)
  • - Custom OCR server (starting at US$ 299/month). We set up OCR servers just for your data at a location of your choice. These serves include unlimited conversions (limited only by hardware). Contact us for a quote.

In addition to connecting to our PRO OCR servers, you can also directly buy our OCR software and host it yourself. This option is described in the next paragraph below.

Top

OCR.space Local Self-hosted, On-Premise OCR Server

OCR.space Local - Enterprise Image and PDF OCR
OCR.space is powerful server-based OCR software for automated document capture and PDF conversion. With OCR.space Local you can install and host our popular OCR API and Searchable PDF creation software on your own PC and/or inside your OCR API Endpoint

Using an OCR API was never easier..

Besides the full-featured 'POST' OCR API at /parse/image we provide an additional OCR API endpoint at /parse/ImageUrl for GET requests. While not as versatile as the POST API, it is easy to use. Everything you need for the api call is inside the URL.

Example (just click the link to start the OCR):
https://api.ocr.space/parse/imageurl?apikey=helloworld&url=http://i.imgur.com/fwxooMv.png
The default OCR language is English. To use another language, add &language to the url. You can also request the x/y word coordinates with isOverlayRequired:
https://api.ocr.space/parse/imageurl?apikey=helloworld&url=http://i.imgur.com/s1JZUnd.gif&language=chs&isOverlayRequired=true

The important limitation of the GET api endpoint is it only allows image and PDF submissions via the URL method, as only HTTP POST requests can supply additional data to the server in the message body. GET requests include all required data in the URL. So by design, a GET api cannot support file uploads (file parameter) or BASE64 strings (base64image).

The GET API is easy and fast to use. Just note that the URL with the api key might be stored in your browser's history. But this is not a security problem because even if somebody gains access to your personal API key, he or she can not access any information about you or the OCR'ed documents, because we do not store such information in the first place. The worst-case scenario is that somebody uses all your free conversions. If this might be a problem for your application, simply continue to use the fully SSL-encrypted POST version of the API or switch to the PRO OCR API, which provides additional options.

Top

The table below lists all possible API parameters. As additional documentation, we published a sample api call collection that you can load into Postman. And last but not least: Our free online ocr form on the front page is nothing else than a POST call to the free OCR API endpoint and can be used for testing as well.

KeyValueDescription
apikeyAPI Key (send in the header)Get your free API key
url or file or base64Imageurl: URL of remote image file (Make sure it has the right content type)
file: Multipart encoded image file with filename
base64Image: Image or PDF as Base64 encoded string
You can use three methods to upload the input image or PDF. We recommend the URL method for file sizes > 10 MB for faster upload speeds.
language [Optional]
Arabic=ara
Bulgarian=bul
Chinese(Simplified)=chs
Chinese(Traditional)=cht
Croatian = hrv
Czech = cze
Danish = dan
Dutch = dut
English = eng
Finnish = fin
French = fre
German = ger
Greek = gre
Hungarian = hun
Korean = kor
Italian = ita
Japanese = jpn
Polish = pol
Portuguese = por
Russian = rus
Slovenian = slv
Spanish = spa
Swedish = swe
Turkish = tur
Language used for OCR. If no language is specified, English eng is taken as default.
IMPORTANT: The language code has always 3-letters (not 2). So it is 'eng' and not 'en'.
Engine2 has automatic Western language detection, so this value will be ignored.
isOverlayRequired [Optional] Boolean value Default = False
If true, returns the coordinates of the bounding boxes for each word. If false, the OCR'ed text is returned only as a text block (this makes the JSON reponse smaller). Overlay data can be used, for example, to show text over the image.
filetype [Optional] String value: PDF, GIF, PNG, JPG, TIF, BMP Overwrites the automatic file type detection based on content-type. Supported image file formats are png, jpg (jpeg), gif, tif (tiff) and bmp. For document ocr, the api supports the Adobe PDF format. Multi-page TIFF files are supported.
detectOrientation [Optional] true/false If set to true, the api autorotates the image correctly and sets the TextOrientation parameter in the JSON response. If the image is not rotated, then TextOrientation=0, otherwise it is the degree of the rotation, e. g. '270'.
isCreateSearchablePdf [Optional] Boolean value Default = False
If true, API generates a searchable PDF. This parameter automatically sets isOverlayRequired = true.
isSearchablePdfHideTextLayer [Optional] Boolean value Default = False. If true, the text layer is hidden (not visible)
scale [Optional] true/false If set to true, the api does some internal upscaling. This can improve the OCR result significantly, especially for low-resolution PDF scans. Note that the front page demo uses scale=true, but the API uses scale=false by default. See also this OCR forum post.
isTable [Optional] true/false If set to true, the OCR logic makes sure that the parsed text result is always returned line by line. This switch is recommended for table OCR, receipt OCR, invoice processing and all other type of input documents that have a table like structure.
OCREngine [Optional] 1 or 2 Engine 1 is default. See OCR Engines.

Tip: When serving images from an Amazon AWS S3 bucket, Google cloud storage or a similar services for use with the 'URL' parameter, make sure the file link has the right content type. It should not be 'Content-Type:application/x-www-form-urlencoded' (which seems to be the default for AWS) but image/png or similar for images. For PDF documents make sure the content type is not 'image/pdf' but application/pdf. You can check the content type of your links for example with this MIME content type checker (external service, not from us). The OCR API uses the content type to automatically detect the correct file. But if you have the wrong content-type and can not change it (e.g. because you do not control the cloud storage), no problem: In this case you can overwrite the automatic file type detection by adding the the filetype= parameter and tell the API directly what type of document you are sending (PNG, JPG, GIF, PDF).

New: If you need to detect the status of checkboxes, please contact us about the Optical Mark Recognition (OMR) (Beta) features.

Top

Select the best OCR Engine

The OCR API offers two different OCR engine with a different processing logic. We recommend that you try both and then use whatever engine gives you the best OCR result. You can use both OCR engines with our free online OCR service on the front page and with the OCREngine=1/2 parameter in your API call.

Features of OCR Engine 1:

  • - Supports more languages (including Asian languages like Chinese, Japanese and Korean)
  • - Faster
  • - Supports larger images
  • - Multi-Page TIFF scan support
  • - Parameter: OCREngine=1

Features of OCR Engine 2:

  • - Western Latin Character languages only (English, German, French,..)
  • - Language auto-detect. It does not matter what OCR language you select, as long as it uses Latin characters
  • - Usually better at single number OCR, single character OCR and alphanumeric OCR in general (e. g. SUDOKO, Dot Matrix OCR, MRZ OCR, Single digit OCR, Missing 1st letter after OCR, .. )
  • - Usually better at special characters OCR like @+-..
  • - Usually better with rotated text (Forum: Detect image spam)
  • - Image size limit 5000px width and 5000px height
  • - Parameter: OCREngine=2

Enterprise Support: Both OCR engines are available for offline, self-hosting as On-Premise OCR!

The returned OCR result JSON response is identical for both engines! You can switch between both engines as needed. The features that are not mentioned in this OCR engine comparison are the same for both engines, for example PDF OCR, detect orientation and receipt scanning support. If you have any question about using Engine 1 or 2, please ask in our OCR API Forum.

Top

The API returns results in JSON format. The result typically contains the ExitCode, Error details (if occurred) and a bunch of parsed results for the Image / PDF pages. Please check below the response the Web API returns and definition of various parameters. The illustration below shows success and error responses.

KeyValueDescription
ParsedResultsOCR resultsThe OCR results for the image or for each page of PDF. For PDF: Each page has its own OCR result and error message (if any)
OCRExitCodeInteger The exit code shows if OCR completed successfully, partially or failed with error
1: Parsed Successfully (Image / All pages parsed successfully)
2: Parsed Partially (Only few pages out of all the pages parsed successfully)
3: Image / All the PDF pages failed parsing (This happens mainly because the OCR engine fails to parse an image)
4: Error occurred when attempting to parse (This happens when a fatal error occurs during parsing )
IsErroredOnProcessingtrue/falseIf an error occurs when parsing the Image / PDF pages
ErrorMessageTextThe error message of the error occurred when parsing the image
ErrorDetailsTextDetailed error message
SearchablePDFURLLinkSee Searchable PDF
IMAGE / PAGE PARSING RESULT
FileParseExitCodeExit code for each parsed result The exit code returned by the parsing engine
0: File not found
1: Success
-10: OCR Engine Parse Error
-20: Timeout
-30: Validation Error
-99: Unknown Error
ParsedTextParsed TextThe parsed text for an image
TextOverlayOverlay data for the text in the image/pdfOnly if 'isOverlayRequired' is set to 'True'
LinesAn array of lines in the overlay textThis contains an array of all the lines. Each line will contain an array of words
WordsAn array of words in a lineThis contains the words with the specific details of a word like its text and position
WordTextText of the wordThis contains the text of that specific word
LeftDistance of word from left (in pixels (px))Contains the distance (in px) of the word from the left edge of the imagee
TopDistance of word from top (in px)Contains the distance (in px) of the word from the top edge of the image
HeightHeight of the wordContains the height (in px) of the word
WidthWidth of the wordContains the width (in px) of the word
MaxHeightMaximum height of the lineContains the height (in px) of the line
MinTopMinimum distance of the line from the top edge of imageContains the distance (in px) of the line from the top edge in the original size of image
HasOverlayOverlay is present or notTrue/False depending upon if the overlay for the parsed result is present or not
ErrorMessageTextError message returned by the parsing engine
ErrorDetailsTextDetailed error message returned from the parsing engine for debugging purposes
Top

You can create searchable PDFs (sometimes also called Sandwich PDFs) directly via the API. The PDF is returned as download link in the API JSON response the form of 'SearchablePDFURL': '..'. The download link is valid for one hour, after this time the document is deleted from our OCR servers.

The isCreateSearchablePdf = true switch triggers the generation of the searchable PDF. By default, the added text layer is visible - - this is ideal for testing the result as you can compare the OCR'ed output directly with the scan image. By adding isSearchablePdfHideTextLayer = true you make the text layer invisible. Creating a searchable PDF from the OCR result takes additional processing time, so you should only activate this feature if you need the OCR result in PDF format.

NOTE: You must use both parameters, isCreateSearchablePdf = true and isSearchablePdfHideTextLayer = false or true, otherwise the generated PDF contains no text layer.

When used with the free OCR API tier, the generated PDF contains a watermark 'Generated by OCR.space' in the lower right corner. With the PRO OCR API, no watermark is added to the PDF.

Top

The fastest way to test the OCR API is to make a GET call - just copy the URL in your web browser.

Test API with the Postman App

Getting started: Use the free Postman app for Windows, Mac and Linux to test the OCR API and play with the different parameters.

Tip: If you have Postman installed you can click the 'Run in Postman' button above to import a set of six API test calls to Postman. The samples use the 'helloworld' api key and are ready to run without any further edits.

(a) Provide image/PDF to be OCR'ed via URL

The screenshots below show the settings for sending the image/PDF via a URL. Filevault password not working. Note that the encoding is set to multipart/form-data.

In all cases (file upload via URL, file or base64) the api key (password) is sent in header:

(b) Upload image/PDF to be OCR'ed from your server/PC

Same Postman app, but this time we are using the 'File' setting to upload the image or PDF.

(c) Send image as Base64 string

Same Postman app, but this time, we are using the 'Base64Image' parameter to send the image as string. Tipard mac video converter ultimate 9 2 16 download free.

Tip: Make sure there is no extra 'new line' after pasting a base64 string into Postman. If there is, the API will (rightfully) return a 'Not a valid base64 image.' error.

Test BASE64 strings

The links open a text file in the browser: Image Base64 String, TIFF Base64 String, PDF as Base64 string. You can copy and paste the content of these text files directly into the 'base64image' field of Postman, or any other test code.

Important: The base64 string must start with the content type of the document. For example use  string here,  string here or for PDF documents data:application/pdf;base64,data string here. Most online image to base64 conversion services do not add this header, they just provide the raw data string. Thus you must add it manually when you use such strings for testing.

cURL command-line

(a) Provide image/PDF to be OCR'ed via URL

curl https://api.ocr.space/Parse/Image -H 'apikey:helloworld' --data 'isOverlayRequired=true&url=http://dl.a9t9.com/blog/ocr-online/screenshot.jpg&language=eng'

curl is an open source command line tool and library for transferring data with URL syntax. The libcurl library is portable. It builds and works identically on nearly any platform (Windows, Mac, Linux,..).

(b) Upload image/PDF to be OCR'ed from your server/PC

curl -H 'apikey:helloworld' --form 'file=@screenshot.jpg' --form 'language=eng' --form 'isOverlayRequired=true' https://api.ocr.space/Parse/Image

Note: @screenshot.jpg assumes an image with name 'screenshot.jpg' is in the same directory as cURL.exe. Note that the isOverlayrequired (default: no) and the language (default: eng) parameters are optional.

(c) Send image as string in Base64 format

curl -H 'apikey:helloworld' --form 'base64Image= [Long string here ]' --form 'language=eng' --form 'isOverlayRequired=false' https://api.ocr.space/parse/image

The base64 string in this example is truncated. You can download the full command line as Windows batch file from GitHub.

We have some test base64 strings available for download.

Top

C# (Visual Studio Project)

There is a ready-to-use Visual Studio C# sample project for using the OCR API from C# on GitHub.

The test app allows you to upload and test any image quickly with the OCR API.

For a real-life example, look at the popular 'ShareX' productivity tool:
ShareX uses the OCR.space PRO API and the full C# source code is available.

Top

iOS: Objective-C and Swift

The user-provided code snippets for Objective-C and Swift are a good starting point for iPhone apps with OCR features.

Top

Android: Java

Using Android? Look at this Android sample app that uses the free OCR API. The Java app shows how to call the API using HttpsURLConnection from user 'bsuhas'. And here is another, different Java repo from user 'Globalizer'. Thanks to both for providing this code snippet.

Top

PHP OCR API Demo Web App

For PHP we have a complete, ready-to-run demo web app that allows the user to select a document and then uploads the image or PDF document to the OCR API.


You find the full source code at Github .

Top

Python

Here is an example of how to access the API from Python using the requests.post command.

The full source code can be found on GitHub (thanks to user 'Zaargh' for providing this code snippet). Another Python wrapper for our OCR SDK is available from GitHub user a4fr (thanks to everyone for creating code snippets).

Top

AutoHotKey (AHK)

AHK is a popular Windows Macro Recorder. For windows automation projects that require to recognize text on images you can connect to the OCR API with CreateFormData(PostData, ContentType, oForm). This AHK forum post has the details.

Top

C++/QT

Using C++? Jhiroka from UCLA shared this example with us: C++/QT OCR API sample app.

If you are using the C++ Casablanca Library for the HTTP POST call, note that you need to url encode the image data on top of Base64 encoding. The C++ library Casablanca does not seem to do this automatically (unlike Postman does), so use the function web::uri::encode_data_string to encode the file data after Base64 encoding the request.

Top

Go

User Matteo made a Github repository with a Go module for the OCR API.

Top

Ruby

Using Ruby? Suyesh shared this Ruby gem (library) with us: OCR API Ruby gem.

Perl

Using Perl? Then have a look at this OCR API user submitted Perl OCR.space module.

Powershell

We have a Powershell OCR code snippet. This includes downloading the generated sandwich PDF.

Top

Javascript

Chrome extension

The open-source Copyfish Chrome, Edge and Firefox extension uses our OCR API. You find its Javascript source code here. This includes code that shows how to process the returned text overlay data. Note the Copyfish extension uses the PRO OCR API version.

A zipper 1 35. Test it: You can install the Copyfish OCR extension in Chrome, Edge, and Firefox.

Top

NPM/Node.js

The latest OCR API Node.JS wrapper is from user DavideViolante. It allows you to specify the OCR Space API endpoints (Free and PRO).
Older Node.JS wrappers: User Dennis.K published a NPM package for the OCR API and Anthony Luzquiños released an updated NPM package for the OCR API.

Top

Jquery

This is a JQuery example showing how to make a request to the api using AJAX and get the image results for processing.

Top

iText is an OCR tool which could recognize text from any image.

You can use iText to extract text from PDF, document in paper, page in a book and any other images.

1. Easily Select Image

iText supports a variety of ways to select images, the operation is very convenient.

1.1 Capture Screen

iText has built-in screen capture tool. Just press the shortcut ⇧⌘1, capture any area on the screen, you can extract the text in it.

Tips: The recognized text has been copied to the system clipboard. You can paste directly.

1.2 Drag the Image to Menubar Icon

For example, when you see an image in Twitter and want to extract the text or number inside, just drag the image to iText's menubar icon, you will get what you want.

Itext pro 1 2 5 – ocr tool software free

1.3 Choose Image File

Of course, you can also select a picture file to recognize. However, dragging mentioned above is preferred in this case.

1.4 Continuously Recognize

For example, taking screenshot of different positions in PDFs, iText will recognize the text in turn and automatically concatenate the results.

2 Accurately Recognize Text

Do you have this experience: You want to extract the text from a picture and found that there are some errors in the recognized text. As a result, the time to manually modify these errors is longer than the time to type them in a computer.

Obviously, accuracy of recognition is very important, that's why I work hard on it.

2.1 Powered by Google

First of all, I excluded offline recognition libraries, as the offline libraries are dead and can't improve itself. Next, in many online OCR services, I compared the products of Microsoft, Google, and others.

Finally, I chose Google's service as it's so powerful, which could recognize 50+ languages.

  • For normal natural language, such as a page of a book, press release, recognition result is amazingly accurate, even up to 100%.
  • For complex typesetting, especially with special characters (e.g., program source code), the recognition result isn't that good, You may need to manually modify the results after recognition.
    • E.g, for just a vertical line, the machine can not distinguish between the lowercase l, or uppercase I (by the way, can you identify them?); In contrast, machine needs to understand the context to optimize the result. But now it's too hard for machine to understand non-natural language like program source code.
Itext Pro 1 2 5 – Ocr Tool Software

1.3 Choose Image File

Of course, you can also select a picture file to recognize. However, dragging mentioned above is preferred in this case.

1.4 Continuously Recognize

For example, taking screenshot of different positions in PDFs, iText will recognize the text in turn and automatically concatenate the results.

2 Accurately Recognize Text

Do you have this experience: You want to extract the text from a picture and found that there are some errors in the recognized text. As a result, the time to manually modify these errors is longer than the time to type them in a computer.

Obviously, accuracy of recognition is very important, that's why I work hard on it.

2.1 Powered by Google

First of all, I excluded offline recognition libraries, as the offline libraries are dead and can't improve itself. Next, in many online OCR services, I compared the products of Microsoft, Google, and others.

Finally, I chose Google's service as it's so powerful, which could recognize 50+ languages.

  • For normal natural language, such as a page of a book, press release, recognition result is amazingly accurate, even up to 100%.
  • For complex typesetting, especially with special characters (e.g., program source code), the recognition result isn't that good, You may need to manually modify the results after recognition.
    • E.g, for just a vertical line, the machine can not distinguish between the lowercase l, or uppercase I (by the way, can you identify them?); In contrast, machine needs to understand the context to optimize the result. But now it's too hard for machine to understand non-natural language like program source code.

Welcome to have a try and feel how accurate the recognition result is.

2.2 Optimize the Recognition Results

OCR services could accurately recognize the text in image, but not that good for further recognition, e.g., paragraph recognition, etc.

So, iText includes its own algorithm to optimize the result, eg.,

  • Automatically identify paragraphs.
  • Remove extra spaces between English words and punctuation characters.
  • Capitalize the first letter for English.

If you find that the optimization is not good, welcome to send the image to me. I will optimize the algorithm corresponding to the image. Thanks in advance.

2.3 Preview the Original Image for Proofing

As current OCR technology cannot always 100% recognize the text, it's necessary to review the original image to modify the result. In iText, you could:

  • Drag the result window nearby the image.
  • Show image in left of the result window.

And then, you will feel easy to update the result.

2.4 Auto Hide Recognition Result

Itext Pro 1 2 5 – Ocr Tool Software Download

Since iText's recognition results are very accurate and have been copied to the clipboard, there is no need to edit or copy the text after recognition. At this point, you can turn on the 'Auto Hide' option as shown above, and the recognition result window will be automatically hidden after 3s, which is very convenient.

In another side, if you need to edit a recognition result temporarily, just move the mouse to the result window, and the auto hide function will be ignored this time. In addition, the window will not be automatically hidden when the 'Pin' option is turned on.

3 Automatically Translate

After recognizing text from image, iText could automatically translate them to 100+ languages, powered by Google.

Itext Pro 1 2 5 – Ocr Tool Software Download

Download

You can recognize text from images 20 times for free each month, or subscribe iText Pro to unlimitedly recognize text from images.

If you also feel iText is helpful, welcome to rate iText on Mac App Store and leave a small review.

If you had any problem using iText or have any suggestions for improvements, please feel free to contact me.

Itext Pro 1 2 5 – Ocr Tool Software Free

I'm looking forward to hearing from you.





broken image