Commit Graph

221 Commits

Author SHA1 Message Date
Chocobozzz
07058e17c3
Better whisper-ctranslate vad filter option 2024-07-03 17:08:32 +02:00
Chocobozzz
efa58350eb
Fix transcription tests 2024-07-03 16:14:54 +02:00
Chocobozzz
8ab6f23a00
Don't block video update on storyboard generation 2024-07-03 15:40:10 +02:00
Chocobozzz
c289c86741
Add vad_filter to ctranslate transcriber
Helps us to correctly detect the language if there is no voice in the
first 30 seconds

Also helps to lower hallucinations
2024-07-03 15:23:26 +02:00
Chocobozzz
2d26eff129
Don't create pendingTranscription twice 2024-07-03 08:52:05 +02:00
Chocobozzz
0870a3fe51
Fix check logs params tests 2024-07-02 13:16:59 +02:00
Chocobozzz
9ee467b9cd
Better whisper logging 2024-07-02 09:01:54 +02:00
Chocobozzz
3a71086e35
Correctly handle transcription conflicts 2024-06-28 14:42:19 +02:00
Chocobozzz
0b30e58f6f
Remove verbose option from transcription
Can be specified on-demand using NODE_DEBUG=execa env variable
2024-06-28 08:51:35 +02:00
Chocobozzz
b66963fe6f
Runner can choose job type 2024-06-28 08:44:59 +02:00
Chocobozzz
fd4831e502
Metadata to know if the caption is auto generated 2024-06-28 08:44:59 +02:00
Chocobozzz
1bfb791e05
Integrate transcription in PeerTube 2024-06-28 08:44:58 +02:00
lutangar
ef14cf4a5c
feat(transcription): groundwork
chore: fiddling around some more

chore: add ctranslate2 and timestamped

chore: add performance markers

chore: refactor test

chore: change worflow name

chore: ensure Python3

chore(duration): convert to chai/mocha syntahx

chore(transcription): add individual tests for others transcribers

chore(transcription): implement formats test of all implementations

Also compare result of other implementation to the reference implementation

chore(transcription): add more test case with other language and models size and local model

chore(test): wip ctranslate 2 adapat

chore(transcription): wip transcript file and benchmark

chore(test): clean a bit

chore(test): clean a bit

chore(test): refacto timestamed spec

chore(test): update workflow

chore(test): fix glob expansion with sh

chore(test): extract some hw info

chore(test): fix async tests

chore(benchmark): add model info

feat(transcription): allow use of a local mode in timestamped-whisper

feat(transcription): extract run and profiling info in own value object

feat(transcription): extract run concept in own class an run more bench

chore(transcription): somplify run object only a uuid is now needed and add more benchmark scenario

docs(transcription): creates own package readme

docs(transcription): add local model usage

docs(transcription): update README

fix(transcription): use fr video for better comparison

chore(transcription): make openai comparison passed

docs(timestamped): clea

chore(transcription): change transcribers transcribe method signature

Introduce whisper builtin model.

fix(transcription): activate language detection

Forbid transcript creation without a language.
Add `languageDetection` flag to an engine and some assertions.

Fix an issue in `whisper-ctranslate2` :
https://github.com/Softcatala/whisper-ctranslate2/pull/93

chore(transcription): use PeerTube time helpers instead of custom ones

Update existing time function to output an integer number of seconds and add a ms human-readable time formatter with hints of tests.

chore(transcription): use PeerTube UUID helpers

chore(transcription): enable CER evaluation

Thanks to this recent fix in Jiwer <3
https://github.com/jitsi/jiwer/issues/873

chore(jiwer): creates JiWer package

I'm not very happy with the TranscriptFileEvaluator constructor... suggestions ?

chore(JiWer): add usage in README

docs(jiwer): update JiWer readme

chore(transcription): use FunMOOC video in fixtures

chore(transcription): add proper english video fixture

chore(transcription): use os tmp directory where relevant

chore(transcription): fix jiwer cli test reference.txt

chore(transcription): move benchmark out of tests

chore(transcription): remove transcription workflow

docs(transcription): add benchmark info

fix(transcription): use ms precision in other transcribers

chore(transcription): simplify most of the tests

chore(transcription): remove slashes when building path with join

chore(transcription): make fromPath method async

chore(transcription): assert path to model is a directory for CTranslate2 transcriber

chore(transcription): ctranslate2 assertion

chore(transcription): ctranslate2 assertion

chore(transcription): add preinstall script for Python dependencies

chore(transcription): add download and unzip utils functions

chore(transcription): add download and unzip utils functions

chore(transcription): download & unzip models fixtures

chore(transcription): zip

chore(transcription): raise download file test timeout

chore(transcription): simplify download file test

chore(transcription): add transcriptions test to CI

chore(transcription): raise test preconditions timeout

chore(transcription): run preinstall scripts before running ci

chore(transcription): create dedicated tmp folder for transcriber tests

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): use short video for local model test

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): setup verbosity based on NODE_ENV value
2024-06-28 08:43:40 +02:00
Chocobozzz
ef0a6b267b
Add server restart test 2024-06-27 11:56:48 +02:00
Chocobozzz
985e79f61e
Fix lint and tests 2024-06-21 15:28:09 +02:00
Chocobozzz
990266f149
Owner must not be able to approve its own comment 2024-06-21 10:27:03 +02:00
Chocobozzz
0adbc73eb9
Fix HLS audio desync on some videos 2024-06-12 10:58:07 +02:00
Chocobozzz
585050821a
Also prune storyboards and actor images 2024-06-05 15:47:37 +02:00
Chocobozzz
2b189131fa
Add house-keeping script 2024-06-05 15:35:58 +02:00
Chocobozzz
9b483bcb78
Fix tests build 2024-06-05 09:12:43 +02:00
Chocobozzz
3427330611
Support update object storage urls 2024-06-05 09:01:40 +02:00
Chocobozzz
96b9748585
Support CLI move of original video file 2024-06-04 09:08:24 +02:00
Chocobozzz
54c140c800
Support object storage in prune script
Also prune original files and user exports
2024-06-03 16:37:44 +02:00
Chocobozzz
4a67994775 Use my-embed component 2024-05-30 16:31:16 +02:00
Chocobozzz
29329d6c45 Implement auto tag on comments and videos
* Comments and videos can be automatically tagged using core rules or
   watched word lists
 * These tags can be used to automatically filter videos and comments
 * Introduce a new video comment policy where comments must be approved
   first
 * Comments may have to be approved if the user auto block them using
   core rules or watched word lists
 * Implement FEP-5624 to federate reply control policies
2024-05-29 15:03:14 +02:00
Chocobozzz
4ea659d569
Handle videos with FPS < 1 2024-05-29 08:56:53 +02:00
Chocobozzz
9b5c7a8dc6
Merge branch 'release/6.1.0' into develop 2024-05-17 11:36:43 +02:00
Chocobozzz
cb88f25ffa
Don't count deleted comments for stats 2024-05-17 11:15:37 +02:00
kontrollanten
1eb8fc2c06 fix(server/plugins): avoid duplicate settings
Filter settings so that the name property is unique.

closes #6356
2024-05-17 10:43:47 +02:00
Chocobozzz
ad5f82495f
Add privacy remover plugin test 2024-05-16 09:41:39 +02:00
Chocobozzz
8d22b9100b
Fix import tests 2024-04-29 08:01:57 +02:00
Chocobozzz
afb28272f5
Check video rights before providing AP information 2024-04-26 10:09:38 +02:00
Chocobozzz
b8635c2606
Compact json-ld AP objects 2024-04-25 11:21:55 +02:00
Chocobozzz
bce0f2f11b
Fix total video file size 2024-04-22 11:48:05 +02:00
Chocobozzz
6f3deaee2e
Fix import tests 2024-04-11 08:13:35 +02:00
Chocobozzz
3bfecf4890
Update runner version 2024-04-04 16:33:06 +02:00
Chocobozzz
5cb3e6a0b8
Use sessionId instead of IP to identify viewer
Breaking: YAML config `ip_view_expiration` is renamed `view_expiration`
Breaking: Views are taken into account after 10 seconds instead of 30
seconds (can be changed in YAML config)

Purpose of this commit is to get closer to other video platforms where
some platforms count views on play (mux, vimeo) or others use a very low
delay (instagram, tiktok)

We also want to improve the viewer identification, where we no longer
use the IP but the `sessionId` generated by the web browser. Multiple
viewers behind a NAT can now be able to be identified as independent
viewers (this method is also used by vimeo or mux)
2024-04-04 16:27:40 +02:00
kontrollanten
6f6abcabfb
feat(plugins): add peertubeHelpers.loadByIdOrUUIDWithFiles (#6302) 2024-04-04 08:20:09 +02:00
kontrollanten
cd42491cf0
feat: add support for sub routes under /my-account (#6218)
* feat: add support for sub routes under /my-account

closes #6217

* feat(plugins/client-routes): page titles

Add support for adding custom page titles in client routes.

* fix(client/PluginPages): reload component upon URL change

* Styling

* docs(plugins): update registerClientRoute

---------

Co-authored-by: Chocobozzz <me@florianbigard.com>
2024-04-04 08:17:59 +02:00
Chocobozzz
4b18b0880d
Fix tests 2024-04-03 16:40:43 +02:00
Chocobozzz
7816fa4d48
Fix lint 2024-04-03 16:16:06 +02:00
Chocobozzz
61fec4e4ef
Better seconds to time formatting 2024-04-03 14:50:30 +02:00
Chocobozzz
9b70c8e7e8
Do not take into account empty view sections 2024-04-03 14:25:21 +02:00
kontrollanten
1b323f4f65 feat(plugins): add req.rawBody for routes
Stripe webhooks endpoints requires to read the raw request body.
https://docs.stripe.com/webhooks#verify-webhook-signatures-with-official-libraries
2024-04-02 11:06:34 +02:00
Chocobozzz
107e6e73a6
Fix import tests 2024-04-02 11:05:00 +02:00
Chocobozzz
7eb0189b73
Fix config test with parallel tests 2024-03-28 10:03:47 +01:00
Chocobozzz
81963e0c6e
Simplify config test 2024-03-28 09:20:23 +01:00
Chocobozzz
817f754c22
Remove difficult to maintain config update method 2024-03-28 09:10:36 +01:00
Chocobozzz
11521f231f
Generate small versions of banners too 2024-03-27 15:08:09 +01:00
kontrollanten
aaa5acbb0c feat(plugins): add filter:api.user.me.get.result
relates to #6219
2024-03-27 13:32:53 +01:00