Ougrid-D (Ougrid Dumdang)

upvoted an article 2 days ago

Article

Illustrated LLM OS: An Implementational Perspective

By

•

Dec 3, 2023

• 11

upvoted an article 17 days ago

Article

Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning

By

•

Feb 20

• 11

upvoted 2 articles about 1 month ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19

• 72

Article

Perspectives for first principles prompt engineering

By

•

Aug 18

• 16

upvoted a paper about 1 month ago

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Paper • 2408.08274 • Published Aug 15 • 11

upvoted 2 articles about 2 months ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

• 56

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 79

upvoted a paper about 2 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31 • 102

upvoted 2 articles about 2 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 157

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29

• 26

upvoted a paper about 2 months ago

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25 • 30

upvoted 3 articles about 2 months ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30

• 50

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 193

Article

LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?

Jul 25

• 18

upvoted a paper about 2 months ago

EVLM: An Efficient Vision-Language Model for Visual Understanding

Paper • 2407.14177 • Published Jul 19 • 42

upvoted a paper 2 months ago

E5-V: Universal Embeddings with Multimodal Large Language Models

Paper • 2407.12580 • Published Jul 17 • 38

upvoted a paper 3 months ago

Wavelets Are All You Need for Autoregressive Image Generation

Paper • 2406.19997 • Published Jun 28 • 28

upvoted 3 articles 3 months ago

Article

Vision Language Models Explained

Apr 11

• 176

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27

• 115

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 166

upvoted 2 papers 3 months ago

VoCo-LLaMA: Towards Vision Compression with Large Language Models

Paper • 2406.12275 • Published Jun 18 • 29

Estimating the Hallucination Rate of Generative AI

Paper • 2406.07457 • Published Jun 11 • 6

upvoted an article 3 months ago

Article

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

By

•

Jun 20

• 26

upvoted a collection 4 months ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 1 day ago • 332

upvoted a paper 4 months ago

To Believe or Not to Believe Your LLM

Paper • 2406.02543 • Published Jun 4 • 31

upvoted 2 articles 4 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 312

Article

Everything About Long Context Fine-tuning

By

•

May 10

• 26

upvoted a paper 5 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

upvoted 2 articles 5 months ago

Article

Improving Prompt Consistency with Structured Generations

Apr 30

• 52

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 272

upvoted a paper 5 months ago

Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models

Paper • 2404.07973 • Published Apr 11 • 30

upvoted an article 5 months ago

Article

CodeGemma - an official Google release for code LLMs

Apr 9

• 99

upvoted 2 papers 6 months ago

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Paper • 2403.19651 • Published Mar 28 • 23

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 64

upvoted 4 papers 7 months ago

upvoted 6 papers 8 months ago

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Paper • 2401.15947 • Published Jan 29 • 48

MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24 • 44

Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11 • 24

LLM Augmented LLMs: Expanding Capabilities through Composition

Paper • 2401.02412 • Published Jan 4 • 36

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Paper • 2401.04092 • Published Jan 8 • 20

DocGraphLM: Documental Graph Language Model for Information Extraction

Paper • 2401.02823 • Published Jan 5 • 34

upvoted 9 papers 9 months ago

Denoising Vision Transformers

Paper • 2401.02957 • Published Jan 5 • 27

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4 • 61

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1 • 21

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Paper • 2401.00849 • Published Jan 1 • 14

Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models

Paper • 2312.17661 • Published Dec 29, 2023 • 13

Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

Paper • 2312.12487 • Published Dec 19, 2023 • 9

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Paper • 2312.13252 • Published Dec 20, 2023 • 27

SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

Paper • 2312.11392 • Published Dec 18, 2023 • 19

VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation

Paper • 2312.09251 • Published Dec 14, 2023 • 6

Ougrid Dumdang

AI & ML interests

Organizations

Ougrid-D's activity

Illustrated LLM OS: An Implementational Perspective

Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Perspectives for first principles prompt engineering

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Mixture of Experts Explained

Serverless Inference with Hugging Face and NVIDIA NIMs

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?

Vision Language Models Explained

Welcome Gemma 2 - Google's new open LLM

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

Uncensor any LLM with abliteration

Everything About Long Context Fine-tuning

Improving Prompt Consistency with Structured Generations

Welcome Llama 3 - Meta's new open LLM

CodeGemma - an official Google release for code LLMs