Thursday, 30 May 2019

Guidelines For How To Use AWS Transcribe

Amazon Transcribe is an automatic speech recognition (ASR) service. This service helps developers to add speech-to-text functionality to their applications. With the help of Amazon Transcribe API, you can study audio files saved in Amazon S3 and have the service return a text file of the transcribed speech. Additionally, you can transmit a live audio stream to Amazon Transcribe and get a stream of transcripts in real time. Also this can be utilized in different common applications, covering the transcription of customer service calls and producing subtitles on video and audio material.

Let’s go through the procedure to know about how practically you can use AWS Transcribe,

Create S3 bucket and upload sample audio file in it.

In this step, download a sample audio file, create a S3 bucket( create a bucket in a region same as of transcribe), then upload the sample file to the S3 bucket. Transcribe accesses these audio and video files for transcription from S3 buckets.

  • Go to S3 console.
  • In the S3 console choose Create bucket.
  • Create a bucket having unique bucket name. There are a number of other restrictions on S3 bucket names as well. Then select a region to create your bucket in.(should be same as region of AWS Transcribe).
  • Click on the newly created bucket and choose upload.
  • Upload sample file in S3 bucket.
  • Select the checkbox next to the sample file in your bucket. A file detail pane will be displayed for the sample file. Copy the link to the file and save it for use later in the tutorial.

Create Transcription Job

  • Go to Amazon Transcribe console.
  • On the Amazon Transcribe console main page, click on the Try Amazon Transcribe link or create job.
  • On the Create transcription job page, in the Name field, type the name of your job.
  • In the S3 input URL field, paste the link to the sample file in your S3 bucket.
  • Use the default values of English for the Language field and .mp3 for Format field.
  • Select Create to start your transcription job.

Review Transcription Results
  • When you click the Create button, you will be taken to the Transcription jobs screen which shows the status of sample-transcription-job. The status can be In progress, Complete, or Failed.
    When the status is Complete, click on the sample-transcription-job link in the Name column to view the transcription results.
  • Then you will see a sample-transcription-job details. Go to the Transcription panel to view the transcription job output. And in the JSON pane you can view the transcription results as it would be returned from the Transcribe API or AWS CLI.

Custom Vocabulary - Helps Amazon Transcribe recognize words and phrases that are specific to your application.

Channel Identification - Transcribe is able to process audio and video where each speaker is recorded on different channels and produce a single transcript with annotated by channel labels.

Speaker Identification - Transcribe recognizes when the speaker changes and attribute the transcribed text appropriately.

This AWS Transcribe feature can be used to transcribe audio files saved in standard formats such as MP3, WAV, including the timestamps for each word with this you can simply find the audio in the original source by looking for the text. We hope that this article will help you to understand the AWS Transcribe. If you have any queries related to this, then feel free to connect with us at www.cloud.in

No comments:

Post a Comment

Now Amazon Athena helps querying data in Amazon S3 Requester Pays buckets

Amazon Athena is an interactive query service which makes it simple to examine data straight in Amazon Simple Storage Service (Amazon S3)...