Heegyu Kim's picture

Heegyu Kim PRO

heegyu

·

https://github.com/HeegyuKim

HeegyuKim

AI & ML interests

NLP

Organizations

heegyu's activity

upvoted a collection 16 days ago

3D

Stability AI's suite of models for 3D generation • 5 items • Updated Aug 9 • 24

upvoted 2 collections 28 days ago

4bit Instruct Models

15 items • Updated 15 days ago • 23

DeepSeek-Prover

DeepSeek-V1-and-V1.5-Series • 7 items • Updated Aug 16 • 13

upvoted a collection 2 months ago

Magpie-Qwen2 Datasets

Dataset built with Qwen2 72B and Qwen2 7B. • 6 items • Updated 5 days ago • 10

upvoted a collection 3 months ago

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 64

upvoted a collection 4 months ago

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8 • 19

upvoted a paper 4 months ago

DoRA: Weight-Decomposed Low-Rank Adaptation

Paper • 2402.09353 • Published Feb 14 • 24

upvoted a collection 5 months ago

Eurus

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Apr 15 • 23

upvoted 2 papers 6 months ago

Shepherd: A Critic for Language Model Generation

Paper • 2308.04592 • Published Aug 8, 2023 • 29

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15 • 56

upvoted a collection 6 months ago

zephyr-7b-sft-full-SPIN

Models fine-tuned with SPIN across iterations 0,1,2,3 • 4 items • Updated Feb 7 • 8

upvoted a paper 7 months ago

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Paper • 2402.06619 • Published Feb 9 • 52

upvoted a collection 7 months ago

Multilingual

128 items • Updated Jul 24 • 10

upvoted 2 papers 7 months ago

xCoT: Cross-lingual Instruction Tuning for Cross-lingual Chain-of-Thought Reasoning

Paper • 2401.07037 • Published Jan 13 • 2

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27 • 23

upvoted a collection 7 months ago

OpenCodeInterpreter

18 items • Updated Mar 3 • 81

upvoted a collection 8 months ago

Tiny Series

Tiny datasets that empower the foundation of Small Language Model! • 11 items • Updated Jan 26 • 34

upvoted a collection 10 months ago

Korean Datasets I've released so far.

지금까지 업로드한 한국어 데이터셋 콜렉션입니다. • 8 items • Updated May 24 • 16