Skip to content

Integration with Speechmatics #38

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
rkpujari opened this issue Nov 30, 2018 · 13 comments
Closed

Integration with Speechmatics #38

rkpujari opened this issue Nov 30, 2018 · 13 comments
Labels
good first issue Good for newcomers help wanted Extra attention is needed Module / Component A self contained module or component Speech To Text Adapters Speech To Text Adapters

Comments

@rkpujari
Copy link

The editor looks great. Thanks for this. I am looking for Speechmatics integration in this. Any plans to add that one? if not, some documentation on how to integrate it into this editor will be helpful. Thanks in advance.

@pietrop pietrop added the question Further information is requested label Nov 30, 2018
@pietrop
Copy link
Contributor

pietrop commented Nov 30, 2018

Thanks @rkpujari , The idea is to make it easier to create adapters and converters to import and export data into the TranscriptEdior component.

We are planning to write some more documentation and guides on how to do that. Should have something in the next couple of weeks.

For now the adapter folder is in /src/lib/TranscriptEditor/TimedTextEditor/adapters and you can see how the BBC Kaldi or the autoEdit one are done, to see what it would take to make a Speechmatics one.

But, yes, will update this issue with more info soon.

@pietrop pietrop mentioned this issue Dec 14, 2018
4 tasks
@pietrop
Copy link
Contributor

pietrop commented Dec 14, 2018

@rkpujari we have a first draft of a PR #51 with some instructions on how to create adapters for various STT services.

We are still tweaking it and reviewing it for clarity etc.. but feedback welcome if this is something you are currently looking at.

@rkpujari
Copy link
Author

rkpujari commented Dec 18, 2018 via email

@pietrop
Copy link
Contributor

pietrop commented Dec 18, 2018

If you want to do a Pull Request with the Speechmatics adapter we can add it to the main component.

We are also working on

  • having the possibility to turn speaker names and time stamps on or off.
  • readjust time stamps ( word timings after text has been edited)
  • hilight words having low confidence ( below 6)
  • export transcript ( for now as plain text)
    ....

@rkpujari
Copy link
Author

rkpujari commented Dec 18, 2018 via email

@pietrop
Copy link
Contributor

pietrop commented Dec 18, 2018

ok, that's great, thanks for sharing this, I think with your code example, and the docs from speechmatics it should be possible to add it as a separate adapter, following the guide in the PR #51 - Guide: How to Create an Adapter - Draft

@pietrop
Copy link
Contributor

pietrop commented Dec 20, 2018

In terms of time estimate @rkpujari as an update we just got the hide/show time-codes and speaker label functionality working in PR #56 and we should be able to add that to master soon.

re-aligning text I reckon end of January / February, to give a conservative estimate🤞

Altho this might not be needed for most use cases, where users are just correcting and exporting text (?)

@rkpujari
Copy link
Author

rkpujari commented Dec 21, 2018 via email

@pietrop
Copy link
Contributor

pietrop commented Dec 21, 2018

Thanks @rkpujari ,

Yes, that is correct, if the user changes incorrect words, (especially if they completely delete a word or a paragraph) then the time-codes will no longer be associated with the text (and they can no longer click on that text to jump to the corresponding point in the media).

And just to clarify, we are exploring ways to realign text, I've written some notes on some progress we have made on that front here #30 we should be able to try this out within the editor in the new year to be able to do some more comprehensive test, and see if it's a valid solution.

@pietrop pietrop added help wanted Extra attention is needed good first issue Good for newcomers and removed question Further information is requested labels Dec 21, 2018
@pietrop pietrop added Module / Component A self contained module or component Speech To Text Adapters Speech To Text Adapters labels Jan 4, 2019
@murezzda
Copy link
Contributor

murezzda commented Feb 7, 2019

I'm also very interested in this project and in the speechmatics adapter for this tool. What are your current plans for officially adding the support? I've already wrote a speechmatics adapter for getting to know the code. I would happily share and provide a pull request if you are interested.

@pietrop
Copy link
Contributor

pietrop commented Feb 7, 2019

Thanks for reaching out @murezzda, yes a PR for the Speechmatics adapter would be great, thanks!

You can see this guide for how to add a new adapter as well as the contributing section section).

And feel free to reach out with any questions you might have.

@pietrop
Copy link
Contributor

pietrop commented Feb 13, 2019

Addressed in PR #94

@pietrop pietrop closed this as completed Feb 13, 2019
@pietrop
Copy link
Contributor

pietrop commented Feb 13, 2019

from version @bbc/[email protected] there is a speechmatics adapter as part of the component, thanks to @murezzda

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed Module / Component A self contained module or component Speech To Text Adapters Speech To Text Adapters
Projects
None yet
Development

No branches or pull requests

3 participants