imdatta0 (Dattu Sharma) – Community Activity

commented 4 papers 7 days ago

commented 3 papers 21 days ago

FocusLLM: Scaling LLM's Context by Parallel Decoding

Paper • 2408.11745 • Published 29 days ago • 23 •

3

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Paper • 2408.12570 • Published 28 days ago • 29 •

3

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published 29 days ago • 53 •

4

New activity in imdatta0/pints 25 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 26 days ago by

librarian-bot

New activity in mistralai/Mistral-7B-Instruct-v0.3 about 1 month ago

Add tool calling support to chat template

9

#68 opened about 2 months ago by

Rocketknight1

commented 4 papers 3 months ago

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Paper • 2404.10719 • Published Apr 16 • 3 •

1

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Paper • 2405.12130 • Published May 20 • 45 •

10

S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs

Paper • 2405.20314 • Published May 30 •

1

Contextual Position Encoding: Learning to Count What's Important

Paper • 2405.18719 • Published May 29 • 5 •

1

New activity in imdatta0/MetaMathQA-40K 4 months ago

Librarian Bot: Add language metadata for dataset

#1 opened 4 months ago by

librarian-bot

New activity in CoreyMorris/MMLU-by-task-Leaderboard 12 months ago

Support for multi column filtering using comma seperated values

2

#2 opened 12 months ago by

imdatta0

Dattu Sharma

AI & ML interests

Organizations

imdatta0's activity

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

KTO: Model Alignment as Prospect Theoretic Optimization

Planning In Natural Language Improves LLM Search For Code Generation

OLMoE: Open Mixture-of-Experts Language Models

FocusLLM: Scaling LLM's Context by Parallel Decoding

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

LLM Pruning and Distillation in Practice: The Minitron Approach

Librarian Bot: Add language metadata for dataset

Add tool calling support to chat template

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs

Contextual Position Encoding: Learning to Count What's Important

Librarian Bot: Add language metadata for dataset

Support for multi column filtering using comma seperated values