Center for Language and Speech Processing @ JHU

university

https://www.clsp.jhu.edu/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Chuanyang-Jin authored a paper 1 day ago

Self-Compacting Language Model Agents

mmarone updated a collection 11 days ago

mmBERT: a modern multilingual encoder

TaiMingLu authored a paper 15 days ago

Strong Teacher Not Needed? On Distillation in LLM Pretraining

View all activity

Papers

DAR: Deontic Reasoning with Agentic Harnesses

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

View all Papers

Collections 3

View 3 collections

spaces 1

Science Hierarchography

Explore academic paper hierarchies and details

models 53

jhu-clsp/mmBERT-small

Fill-Mask • Updated Oct 17, 2025 • 22k • • 76

jhu-clsp/mmBERT-base

Fill-Mask • Updated Oct 7, 2025 • 346k • • 217

jhu-clsp/mmBERT-checkpoints

Updated Sep 9, 2025 • 4

jhu-clsp/ettin-decoder-1b

Fill-Mask • Updated Jul 21, 2025 • 21 • 5

jhu-clsp/ettin-decoder-32m

Text Generation • Updated Jul 18, 2025 • 310

jhu-clsp/ettin-encoder-1b

Feature Extraction • Updated Jul 18, 2025 • 1.94k • 23

jhu-clsp/ettin-encoder-68m

Fill-Mask • Updated Jul 18, 2025 • 66.6k • • 5

jhu-clsp/ettin-dec-from-enc-32m

Text Generation • Updated Jul 18, 2025 • 4

jhu-clsp/ettin-encoder-150m

Fill-Mask • Updated Jul 18, 2025 • 6.02k • • 13

jhu-clsp/ettin-decoder-400m

Text Generation • Updated Jul 18, 2025 • 6.95k • 4

datasets 40

jhu-clsp/ManyIH-Bench

Preview • Updated Apr 13 • 47 • 3

jhu-clsp/robust04-instructions

Viewer • Updated Mar 12 • 136k • 1.53k • 2

jhu-clsp/core17-instructions

Viewer • Updated Mar 12 • 49.4k • 1.64k • 2

jhu-clsp/news21-instructions

Viewer • Updated Mar 12 • 71.5k • 1.43k • 1

jhu-clsp/SciTaRC

Viewer • Updated Mar 6 • 371 • 52 • 1

jhu-clsp/megawika-2

Updated Mar 3 • 100 • 4

jhu-clsp/mmBERT-decay-data

Updated Dec 11, 2025 • 33.1k • 6

jhu-clsp/mmBERT-midtraining-data

Updated Oct 13, 2025 • 2.24k • 1

jhu-clsp/ettin-pretraining-data

Updated Jul 18, 2025 • 129k • 9

jhu-clsp/ettin-decay-data

Updated Jul 18, 2025 • 973 • 1

View 40 datasets