Add BOBSL, ISL-HS, Sign-BD datasets to the datasets table and references.bi #28

cleong110 · 2024-03-22T23:13:30Z

Closes #27 if accepted

AmitMY · 2024-03-23T08:38:48Z

src/datasets/BOBSL.json

+    "#signers": 37,
+    "license": "non-commercial authorized academics",
+    "licenseUrl": "https://www.bbc.co.uk/rd/projects/extol-dataset",
+    "contact": "Samuel Albanie albanie[AT]robots.ox.ac.uk"


i’d change the contact to a real email address

Ah yes, copy-pasted that from the project website, good suggestion

AmitMY · 2024-03-23T08:39:18Z

src/references.bib

+  address = {Cham},
+  doi = {10.1007/978-3-031-19833-5_39},
+  abstract = {Recently, sign language researchers have turned to sign language interpreted TV broadcasts, comprising (i) a video of continuous signing and (ii) subtitles corresponding to the audio content, as a readily available and large-scale source of training data. One key challenge in the usability of such data is the lack of sign annotations. Previous work exploiting such weakly-aligned data only found sparse correspondences between keywords in the subtitle and individual signs. In this work, we propose a simple, scalable framework to vastly increase the density of automatic annotations. Our contributions are the following: (1)~we significantly improve previous annotation methods by making use of synonyms and subtitle-signing alignment; (2)~we show the value of pseudo-labelling from a sign recognition model as a way of sign spotting; (3)~we propose a novel approach for increasing our annotations of known and unknown classes based on in-domain exemplars; (4)~on the BOBSL BSL sign language corpus, we increase the number of confident automatic annotations from 670K to 5M. We make these annotations publicly available to support the sign language research community.},


i think for making this file not huge, we decided to not include abstracts

makes sense. Next time I can add that as an "exclude" field from my BetterBibTex plugin on Zotero

src/datasets/ISL-HS.json

AmitMY · 2024-03-23T08:40:48Z

src/datasets/BOBSL.json

+      "video:RGB",
+      "text:English"
+    ],
+    "language": "British Sign Language (BSL)",


Should be "British"

AmitMY · 2024-03-23T08:40:58Z

src/datasets/ISL-HS.json

+      "video:RGB",
+      "gloss:ISL-HandShapes"
+    ],
+    "language": "Irish Sign Language (ISL)",


should be "Irish"

AmitMY · 2024-03-23T08:43:30Z

src/datasets/ISL-HS.json

+    ],
+    "language": "Irish Sign Language (ISL)",
+    "#items": 23,
+    "#samples": "468 videos available, 58,114 images extracted to show 23 handshapes",


While here I'd say that more details are better, please consider how this is displayed:
I think less information should be present, but if you still wanted to include the entire text, I'd say narrow it to
"468 videos → 58,114 images → 23 handshapes"

Makes sense, needs to be concise for the table

cleong110 added 4 commits March 22, 2024 18:36

CDL: add BOBSL

b4461bc

CDL: add SignBD-Word and correct a mistake in BOBSL

38a2ada

CDL: the technical report for BOBSL has a figure for sign vocab

f3087c1

CDL: adding ISL-HS dataset

bc9bc50

AmitMY requested changes Mar 23, 2024

View reviewed changes

cleong110 added 2 commits March 25, 2024 12:43

CDL: remove abstracts, add SignBD-Word.json

4ba3472

CDL: fix BOBSL.json, ISL-HS.json, SignBD-Word style/content.

e5ae715

AmitMY approved these changes Mar 25, 2024

View reviewed changes

AmitMY merged commit 49d51b5 into sign-language-processing:master Mar 25, 2024

cleong110 mentioned this pull request May 23, 2024

Look through "Awesome Sign Language", etc and add missing items cleong110/sign-language-processing.github.io#2

Open

34 tasks

This was referenced Jun 7, 2024

YoutubeASL dataset cleong110/sign-language-processing.github.io#12

Open

PopSign ASL dataset cleong110/sign-language-processing.github.io#20

Closed

This was referenced Jun 19, 2024

Dataset/LSA-T #88

Merged

Add VGT Corpus to list of datasets #74

Closed

CSL-Daily dataset cleong110/sign-language-processing.github.io#13

Open

cleong110 mentioned this pull request Jun 26, 2024

Add BSL-1K dataset #71

Closed

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add BOBSL, ISL-HS, Sign-BD datasets to the datasets table and references.bi #28

Add BOBSL, ISL-HS, Sign-BD datasets to the datasets table and references.bi #28

cleong110 commented Mar 22, 2024

AmitMY Mar 23, 2024

cleong110 Mar 25, 2024

AmitMY Mar 23, 2024

cleong110 Mar 25, 2024

AmitMY Mar 23, 2024

cleong110 Mar 25, 2024

AmitMY Mar 23, 2024

cleong110 Mar 25, 2024

AmitMY Mar 23, 2024

cleong110 Mar 25, 2024

Add BOBSL, ISL-HS, Sign-BD datasets to the datasets table and references.bi #28

Add BOBSL, ISL-HS, Sign-BD datasets to the datasets table and references.bi #28

Conversation

cleong110 commented Mar 22, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment