Skip to content

Conversation

viren-nadkarni
Copy link
Member

@viren-nadkarni viren-nadkarni commented Mar 4, 2025

Changes

This PR adds support for 9 new languages in Transcribe:

  • Catalan
  • Czech
  • Gujarati
  • Kazakh
  • Korean
  • Telugu
  • Polish
  • Uzbek
  • Ukrainian

It also bumps model versions for following languages, which should give an improved transcription accuracy:

  • Chinese
  • Farsi
  • Spanish
  • Italian
  • Russian
  • Vietnamese

@viren-nadkarni viren-nadkarni self-assigned this Mar 4, 2025
Copy link

github-actions bot commented Mar 4, 2025

LocalStack Community integration with Pro

 2 files  ±    0   2 suites  ±0   1m 12s ⏱️ - 1h 51m 23s
23 tests  - 4 089  21 ✅  - 3 771  2 💤  - 318  0 ❌ ±0 
25 runs   - 4 089  21 ✅  - 3 771  4 💤  - 318  0 ❌ ±0 

Results for commit d6c60c3. ± Comparison against base commit 2e9b26c.

This pull request removes 4089 tests.
tests.aws.scenario.bookstore.test_bookstore.TestBookstoreApplication ‑ test_lambda_dynamodb
tests.aws.scenario.bookstore.test_bookstore.TestBookstoreApplication ‑ test_opensearch_crud
tests.aws.scenario.bookstore.test_bookstore.TestBookstoreApplication ‑ test_search_books
tests.aws.scenario.bookstore.test_bookstore.TestBookstoreApplication ‑ test_setup
tests.aws.scenario.kinesis_firehose.test_kinesis_firehose.TestKinesisFirehoseScenario ‑ test_kinesis_firehose_s3
tests.aws.scenario.lambda_destination.test_lambda_destination_scenario.TestLambdaDestinationScenario ‑ test_destination_sns
tests.aws.scenario.lambda_destination.test_lambda_destination_scenario.TestLambdaDestinationScenario ‑ test_infra
tests.aws.scenario.loan_broker.test_loan_broker.TestLoanBrokerScenario ‑ test_prefill_dynamodb_table
tests.aws.scenario.loan_broker.test_loan_broker.TestLoanBrokerScenario ‑ test_stepfunctions_input_recipient_list[step_function_input0-SUCCEEDED]
tests.aws.scenario.loan_broker.test_loan_broker.TestLoanBrokerScenario ‑ test_stepfunctions_input_recipient_list[step_function_input1-SUCCEEDED]
…

♻️ This comment has been updated with latest results.

@viren-nadkarni viren-nadkarni added the semver: minor Non-breaking changes which can be included in minor releases, but not in patch releases label Mar 10, 2025
@viren-nadkarni viren-nadkarni changed the title Transcribe: New langauges Transcribe: New language models Mar 10, 2025
@viren-nadkarni viren-nadkarni marked this pull request as ready for review March 10, 2025 13:50
@viren-nadkarni viren-nadkarni removed the request for review from ackdav March 10, 2025 13:50
Copy link
Contributor

@sannya-singal sannya-singal left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for adding new language models and updating some existing ones 🚀 and updating our hugging face models with the newer ones!

@viren-nadkarni viren-nadkarni merged commit 703daf2 into master Mar 13, 2025
37 checks passed
@viren-nadkarni viren-nadkarni deleted the transcribe-new-langs branch March 13, 2025 06:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
semver: minor Non-breaking changes which can be included in minor releases, but not in patch releases
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants