Convert PDF file to images and recognize text using Aspose Cloud APIs

Share on FacebookTweet about this on TwitterShare on LinkedIn

Aspose Cloud APIs have been designed to offer a variety of features to create, manipulate and convert documents of different file formats. Whether it’s a workbook or a presentation, a PDF file or a set of images; Aspose Cloud file format APIs have solution for various kinds of document manipulation requirements. You can use these APIs in your applications to enjoy a whole new experience of processing the documents in the cloud.

We have provided SDK and REST examples for these REST APIs in different programming languages such as .NET, RUBY, Java and PHP that you can utilize in your application. A very important and interesting aspect of Aspose Cloud APIs is that you can integrate multiple file format APIs to combine a variety of features and achieve the desired results.

There might be scenarios where you want to get PDF file as images using Aspose.PDF Cloud and extract text from the images using Aspose Cloud.OCRAspose.PDF Cloud is a REST API for creating and editing PDF files and converting to other file formats. Aspose.OCR Cloud is a REST API for optical character recognition and document scanning. Let’s have a look at how you can use these two REST APIs together to work with PDF files and text recognition.

You can convert PDF file to images using Aspose.PDF Cloud API. This REST API allows converting the PDF file to images in the cloud; it converts the PDF file to images, you may choose to convert the whole PDF file to image, or you may choose to convert the required pages. The supported image formats are JPEG, PNG, GIF, BMP, TIFF etc.

Once you have converted the PDF files to images, you can use Aspose.OCR Cloud REST API to recognize text from images and save it to the database. You can also recognize the font attributes from extracted text such as font type, font style and font size through Aspose Cloud.OCR.

Aspose.PDF Cloud Examples:

Convert PDF page to image

Convert a PDF page to image with default size

Convert a PDF page to image with specified size

Aspose.OCR Cloud Examples:

Extract text from images

Aspose.PDF Cloud supports this very strong and useful feature of converting PDF files to images. You can also convert PDF page to image with default size or specified size. You can choose to manipulate the images using Aspose Cloud APIs; for instance, Aspose.Cloud for OCR to recognize a collection of characters from images in different languages like English, French, and Spanish. So using a combination of these two REST APIs, you can easily achieve quality results of image extraction and character recognition. For more information, you can refer to documentation of Aspose.PDF Cloud and Aspose.OCR Cloud. Please write to us in case of any queries or your requirements of using combination of REST APIs to produce better results. Stay tuned to our blogs for more updates and announcements.