Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
36.4
TFLOPS
671
110
482
Tom Aarsen
tomaarsen
Follow
ekrombouts's profile picture
DeepMount00's profile picture
usman-sheikh's profile picture
717 followers
·
117 following
https://linkedin.com/in/tomaarsen
tomaarsen
tomaarsen
tomaarsen
AI & ML interests
NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification
Articles
Welcome Gemma 2 - Google's new open LLM
Jun 27
•
115
Training and Finetuning Embedding Models with Sentence Transformers v3
May 28
•
146
Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon
Apr 3
•
8
Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval
Mar 22
•
55
🪆 Introduction to Matryoshka Embedding Models
Feb 23
•
46
SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit
Dec 6, 2023
•
5
🕳️ Attention Sinks in LLMs for endless fluency
Oct 9, 2023
•
6
Organizations
tomaarsen
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
gowitheflow/supervised-multilingual
about 6 hours ago
Sources
1
#3 opened about 6 hours ago by
tomaarsen
New activity in
jinaai/jina-reranker-v2-base-multilingual
about 7 hours ago
HugginFace text-embeddings-inference (TEI) support
3
#26 opened 1 day ago by
leversberg
New activity in
Alibaba-NLP/gte-Qwen2-1.5B-instruct
1 day ago
Qwen 2.5 1.5B retrain?
2
#12 opened 1 day ago by
tomaarsen
New activity in
jinaai/jina-embeddings-v3
1 day ago
Unable to load the model through sentence_transformerss
8
#22 opened 1 day ago by
adi751
New activity in
Alibaba-NLP/gte-multilingual-base
2 days ago
How to fine-tune?
1
#10 opened 20 days ago by
havardox
Fix broken SentenceTransformer snippet; format code with Python format
#11 opened 2 days ago by
tomaarsen
New activity in
mixedbread-ai/mxbai-rerank-large-v1
8 days ago
Deployment using TEI
3
#7 opened 8 days ago by
WolfAssi285
New activity in
sentence-transformers/embedding-training-data
8 days ago
Issues Loading Dataset
3
#1 opened 11 months ago by
colbertv2
New activity in
sentence-transformers/distiluse-base-multilingual-cased
10 days ago
Language list
5
#2 opened about 2 years ago by
lbourdois
New activity in
mattshumer/Reflection-Llama-3.1-70B
10 days ago
🤔⚔️ David vs Goliath: How Small Models Are Shaking Up the AI Giants Without Billion-Dollar Infrastructures 😬📉
2
#59 opened 10 days ago by
gohelrakesh
New activity in
jinaai/xlm-roberta-flash-implementation
10 days ago
feat: support sentence-transformers
1
#42 opened 10 days ago by
bwang0911
New activity in
zeta-alpha-ai/Zeta-Alpha-E5-Mistral
13 days ago
Integrate with Sentence Transformers (+ third parties like LangChain/Haystack/LlamaIndex, etc.)
1
#1 opened 13 days ago by
tomaarsen
New activity in
nomic-ai/nomic-embed-text-v1
14 days ago
keeping data local
12
#12 opened 7 months ago by
tyler-rankin-opg
New activity in
nvidia/NV-Embed-v2
16 days ago
Improve model metadata
1
#2 opened 17 days ago by
tomaarsen
New activity in
Snowflake/snowflake-arctic-embed-m-v1.5
20 days ago
Uninitialised weights warning when loading with Sentence Transformers
4
#4 opened about 1 month ago by
cpierse
Specify add_pooling_layer=False via configuration instead
#5 opened 20 days ago by
tomaarsen
New activity in
nomic-ai/nomic-embed-text-v1.5
21 days ago
Slow inference performance when using nomic-embed-text-v1.5
5
#34 opened 21 days ago by
umesh-c
New activity in
nvidia/NV-Embed-v2
21 days ago
Remove ` (default)` from MTEB metadata
#1 opened 21 days ago by
tomaarsen
New activity in
jinaai/jina-embeddings-v2-base-en
24 days ago
How to finetune the model using multiple GPUs ?
4
#45 opened 24 days ago by
Space192
New activity in
ai-forever/ru-en-RoSBERTa
24 days ago
Adopt MTEB dataset naming scheme
1
#1 opened 24 days ago by
tomaarsen
New activity in
nomic-ai/nomic-embed-text-v1.5
24 days ago
nomic-embed-text running slowly
2
#32 opened about 1 month ago by
xtreme786
New activity in
Snowflake/snowflake-arctic-embed-m-v1.5
28 days ago
Languages?
5
#2 opened about 2 months ago by
Mazyod
New activity in
sentence-transformers/all-MiniLM-L6-v2
about 1 month ago
Similarity Search Type
1
#77 opened about 1 month ago by
frbackup
New activity in
answerdotai/answerai-colbert-small-v1
about 1 month ago
Fix a snippet
1
#1 opened about 1 month ago by
tomaarsen
New activity in
sentence-transformers/all-MiniLM-L6-v2
about 1 month ago
Trying to run this model locally, on my machine
4
#75 opened about 1 month ago by
abdulrafay97
New activity in
mixedbread-ai/deepset-mxbai-embed-de-large-v1
about 2 months ago
fix: config.json
6
#4 opened about 2 months ago by
ouz-m
New activity in
sarkii/MizoEmbed-1
about 2 months ago
Set language to "lus" for Lushai
#2 opened about 2 months ago by
tomaarsen
New activity in
Snowflake/snowflake-arctic-embed-m-v1.5
about 2 months ago
Remove ` (default)` from MTEB scores caused by an MTEB bug
#3 opened about 2 months ago by
tomaarsen
New activity in
BAAI/bge-multilingual-gemma2
about 2 months ago
Integrate with Sentence Transformers (+ third parties like LangChain/Haystack/LlamaIndex, etc.)
1
#1 opened about 2 months ago by
tomaarsen
New activity in
mixedbread-ai/deepset-mxbai-embed-de-large-v1
about 2 months ago
Fix a small typo
1
#3 opened about 2 months ago by
tomaarsen
New activity in
dunzhang/stella_en_1.5B_v5
about 2 months ago
Getting different results for the same examples provided in sample
4
#17 opened about 2 months ago by
sramakintel
New activity in
dunzhang/stella_en_400M_v5
about 2 months ago
Fix query_prompt_name variable name
#9 opened about 2 months ago by
tomaarsen
New activity in
dunzhang/stella_en_1.5B_v5
about 2 months ago
Fix query_prompt_name variable name
#15 opened about 2 months ago by
tomaarsen
Error when loading model KeyError: 'qwen2'
1
#11 opened about 2 months ago by
longluu
New activity in
jinaai/jina-reranker-v2-base-multilingual
about 2 months ago
data did not match any variant of untagged enum PyPreTokenizerTypeWrapper at line 69 column 3
3
#17 opened 2 months ago by
sigridjineth
New activity in
Snowflake/snowflake-arctic-embed-m
about 2 months ago
Sentence Transformers integration
3
#2 opened 5 months ago by
tomaarsen
New activity in
dunzhang/stella_en_1.5B_v5
2 months ago
Model max_seq_length
6
#6 opened 2 months ago by
shuyuej
New activity in
sentence-transformers/all-MiniLM-L6-v2
2 months ago
Does the model take our data and use our data to improve itself or for some other purpose?
2
#70 opened 2 months ago by
AIProDK
New activity in
Snowflake/snowflake-arctic-embed-m
2 months ago
TypeError: Pooling.__init__() got an unexpected keyword argument 'include_prompt'
1
#13 opened 2 months ago by
Rageshhf
New activity in
dunzhang/stella_en_1.5B_v5
2 months ago
Set 1024 as default dim, update usage snippets, store prompts in config
#1 opened 2 months ago by
tomaarsen
New activity in
dunzhang/stella_en_400M_v5
2 months ago
Set 1024 as default dim, update usage snippets, store prompts in config
#1 opened 2 months ago by
tomaarsen
New activity in
mteb/leaderboard
2 months ago
New Embedding Model for MTEB - Retriever/BIER Benchmark - Applying for refresh
11
#134 opened 2 months ago by
nv-bschifferer
e5-R-mistral-7b for retrieval, apply for refreshing the results
16
#132 opened 2 months ago by
BeastyZ
New activity in
nvidia/NV-Retriever-v1
2 months ago
Remove ` (default)` that has been included due to a bug in MTEB
#1 opened 2 months ago by
tomaarsen
New activity in
SetFit/bbc-news
3 months ago
Update README.md to include correct attribution and source
1
#1 opened 3 months ago by
derekgreene
New activity in
tomaarsen/mining_demo
3 months ago
Librarian Bot: Add language metadata for dataset
#2 opened 3 months ago by
librarian-bot
New activity in
jinaai/jina-clip-v1
3 months ago
How to create embeddings in the browser?
2
#16 opened 3 months ago by
gnoel-ddh
New activity in
mteb/leaderboard
3 months ago
New model and CMTEB leaderboard refresh request
1
#130 opened 3 months ago by
lier007
New activity in
lier007/xiaobu-embedding-v2
3 months ago
Set `library_name` as Sentence Transformers
#1 opened 3 months ago by
tomaarsen
New activity in
Lajavaness/bilingual-embedding-large
3 months ago
Questions about model & architecture
5
#1 opened 3 months ago by
tomaarsen
New activity in
jinaai/jina-embeddings-v2-base-en
3 months ago
Saving and Loading the fine-tuned model
11
#24 opened 11 months ago by
maiia-bocharova
New activity in
gbyuvd/ChemEmbed-v01
3 months ago
Fascinating work!
3
#1 opened 3 months ago by
tomaarsen
New activity in
jinaai/jina-clip-v1
3 months ago
Could not locate the jinaai/jina-clip-implementation--configuration_clip.py inside jinaai/jina-clip-v1.
6
#15 opened 3 months ago by
SergeShirokov
New activity in
sentence-transformers/msmarco-msmarco-distilbert-base-tas-b
3 months ago
What is the best performant "dataset" in MS MARCO Mined Triplets?
2
#1 opened 3 months ago by
kyeongpil
New activity in
mteb/leaderboard
3 months ago
New Embedding Models | Apply for refershing the results
6
#128 opened 3 months ago by
Omartificial-Intelligence-Space
New activity in
Omartificial-Intelligence-Space/Arabic-NLi-Triplet
3 months ago
Add "sentence-transformers" tag
1
#2 opened 3 months ago by
tomaarsen
New activity in
mteb/leaderboard
3 months ago
New model and mteb leaderboard refresh request
2
#129 opened 3 months ago by
lvkaokao
New activity in
jinaai/jina-reranker-v2-base-multilingual
3 months ago
Describe CrossEncoder integration with Sentence Transformers
3
#8 opened 3 months ago by
tomaarsen
New activity in
Omartificial-Intelligence-Space/Arabic-MiniLM-L12-v2-all-nli-triplet
3 months ago
Add Arabic language tag for easier discoverability
3
#1 opened 3 months ago by
tomaarsen
New activity in
jinaai/jina-reranker-v2-base-multilingual
3 months ago
fix: sagemaker import issue
2
#6 opened 3 months ago by
jemfu
Load more