Wiki:
https://fedoraproject.org/wiki/Changes/ibus-speech-to-text_pywhispercpp

Discussion Thread: https://discussion.fedoraproject.org/t/179397

**This is a proposed Change for Fedora Linux.**
This document represents a proposed Change. As part of the Changes process,
proposals are publicly announced in order to receive community feedback.
This proposal will only be implemented if approved by the Fedora
Engineering Steering Committee.

== Summary ==

ibus-speech-to-text 0.7.0 introduces support for OpenAI's Whisper engine
via pywhispercpp (python bindings of WhisperCpp) in addition to the
existing Vosk engine.

== Owner ==

* Name: [[User:matiwari|Manish Tiwari]]
* Email: [email protected]

== Detailed Description ==

Key ibus-speech-to-text-0.7.0 Changes:
* ibus-speech-to-text provides a new backend engine option allowing users
to select between Vosk and Whisper engine
*  It has a new GStreamer engine to integrate WhisperCpp into
ibus-speech-to-text pipeline
* It supports multiple Whisper models, including locally installed models
and online models downloaded from Hugging Face
* Automatic locale based model selection when possible
* UI updates to allow backend switching and model management from setup tool

== Feedback ==



== Benefit to Fedora ==

This package will bring several benefits to Fedora:
* Higher accuracy speech recognition
* Greater flexibility by allowing users to choose between multiple backends

== Scope ==

* Proposal owners:
** Package pywhispercpp ([
https://bugzilla.redhat.com/show_bug.cgi?id=2420917]) [done]
* Other developers:  N/A
* Release engineering: [https://pagure.io/releng/issues #Releng issue
number]
* Policies and guidelines: N/A (not needed for this Change)
* Trademark approval: N/A (not needed for this Change)
* Alignment with the Fedora Strategy:

== Upgrade/compatibility impact ==

Existing ibus-speech-to-text installations will continue to use the Vosk
backend by default. No existing configuration or functionality is removed.

== Early Testing (Optional) ==

Do you require 'QA Blueprint' support? N

== How To Test ==

=== Functionality Test ===
1. Install required packages:`sudo dnf install ibus-speech-to-text`

2. Restart IBus using `ibus restart` command

3. Add `Speech To Text` in input sources

4. Launch the `IBus STT Setup` tool from the preferences for a
configuration and to download a language model

5. From `Setup tool` select `Whisper` as a backend then select and download
Whisper model from list of available model for each locale

== User Experience ==

Users will see a new backend option in ibus-speech-to-text settings with a
variety of Whisper models.

== Dependencies ==

* pywhispercpp

== Contingency Plan ==

* Contingency mechanism: N/A (Not a system wide change)
* Contingency deadline: N/A (Not a system wide change)
* Blocks release? N/A (Not a system wide change)

== Documentation ==

N/A (Not a system wide change)

== Release Notes ==

\n\nibus-speech-to-text now supports the WhisperCpp speech recognition
engine via pywhispercpp, providing improved accuracy and multilingual
support.
-- 
_______________________________________________
devel-announce mailing list -- [email protected]
To unsubscribe send an email to [email protected]
Fedora Code of Conduct: 
https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/[email protected]
Do not reply to spam, report it: 
https://pagure.io/fedora-infrastructure/new_issue
-- 
_______________________________________________
devel mailing list -- [email protected]
To unsubscribe send an email to [email protected]
Fedora Code of Conduct: 
https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/[email protected]
Do not reply to spam, report it: 
https://pagure.io/fedora-infrastructure/new_issue

Reply via email to