Jeonghwan Park PRO

maywell

https://www.linkedin.com/in/jeonghwan-park-6b97b1245

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

nvidia/Kimi-K2.6-NVFP4

liked a model 4 days ago

dealignai/Gemma-4-31B-JANG_4M-CRACK

liked a model 12 days ago

google/gemma-4-31B-it-assistant

View all activity

Organizations

upvoted an article 5 months ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

sionic-ai

•

Dec 8, 2025

• 57

upvoted 2 papers 6 months ago

Flash Sparse Attention: An Alternative Efficient Implementation of Native Sparse Attention Kernel

Paper • 2508.18224 • Published Aug 25, 2025 • 1

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Paper • 2511.09611 • Published Nov 12, 2025 • 71

upvoted a paper 7 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 87

upvoted 2 articles 8 months ago

Article

Vocabulary is the most important element of Sparse Retrieval

yjoonjang

•

Oct 4, 2025

• 10

Article

Training and Finetuning Reranker Models with Sentence Transformers

tomaarsen

•

Mar 26, 2025

• 194

upvoted an article 9 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

kuotient

•

Aug 9, 2025

• 57

upvoted a paper about 1 year ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26, 2025 • 65

upvoted 3 papers over 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 379

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 16

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11, 2024 • 38

upvoted an article over 1 year ago

Article

Navigating Korean LLM Research #1: Models

amphora

•

Oct 22, 2024

• 26

upvoted a paper over 1 year ago

Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

Paper • 2410.10814 • Published Oct 14, 2024 • 51

upvoted a collection over 1 year ago

Gemma-APS Release

Collection

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated Mar 12 • 26

upvoted 2 articles over 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf

•

Sep 18, 2024

• 280

Article

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

chansung

•

Aug 22, 2024

• 13

upvoted an article almost 2 years ago

Article

Training and Finetuning Embedding Models with Sentence Transformers

tomaarsen

•

May 28, 2024

• 274

upvoted a paper almost 2 years ago

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 83

upvoted an article almost 2 years ago

Article

Putting RL back in RLHF

vwxyzjn, ArashAhmadian

•

Jun 12, 2024

• 111

upvoted a paper almost 2 years ago

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Paper • 2406.12793 • Published Jun 18, 2024 • 34

Jeonghwan Park PRO

AI & ML interests

Recent Activity

Organizations

maywell's activity

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Vocabulary is the most important element of Sparse Retrieval

Training and Finetuning Reranker Models with Sentence Transformers

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Navigating Korean LLM Research #1: Models

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified

Training and Finetuning Embedding Models with Sentence Transformers

Putting RL back in RLHF