Skip to content
View lovit's full-sized avatar
🧩
Focusing
🧩
Focusing

Highlights

  • Pro

Organizations

@ko-nlp

Block or report lovit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,883 8,005 Updated Jan 13, 2026

Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)

Python 48 10 Updated Mar 2, 2024

🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드

Python 59 8 Updated May 23, 2023

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 159,593 14,170 Updated Jan 16, 2026

An open-source NLP research library, built on PyTorch.

Python 11,889 2,236 Updated Nov 22, 2022
Python 1,504 113 Updated May 12, 2023

Polyglot: Large Language Models of Well-balanced Competence in Multi-languages

484 42 Updated Aug 22, 2023
Jupyter Notebook 1,474 202 Updated Sep 16, 2022

Best Practices on Recommendation Systems

Python 21,354 3,280 Updated Jan 16, 2026

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

Python 786 197 Updated May 19, 2024

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

Python 1,015 139 Updated Jan 30, 2024

Utilities for parsing Wikipedia MySQL/MariaDB dumps.

Python 12 4 Updated Mar 6, 2023

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Python 895 89 Updated Aug 20, 2024

"A survey of Transformer" paper study 👩🏻‍💻🧑🏻‍💻 KoreaUniv. DSBA Lab

185 19 Updated Nov 4, 2021

Parse strings using a specification based on the Python format() syntax.

Python 1,778 106 Updated Dec 3, 2025

Scikit-learn compatible implementations of the Random Rotation Ensemble idea of (Blaser & Fryzlewicz, 2016)

Python 43 6 Updated Mar 21, 2016

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2,647 336 Updated May 30, 2023

A library to detect what alphabet something is written in.

Python 153 14 Updated May 20, 2017

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

TeX 2,440 443 Updated Aug 9, 2024

Solves basic Russian NLP tasks, API for lower level Natasha projects

Python 1,303 111 Updated Oct 17, 2024

Juman++ (a Morphological Analyzer Toolkit)

C++ 406 46 Updated Oct 3, 2023
Jupyter Notebook 1 Updated Jan 23, 2021

Paper List for Style Transfer in Text

1,624 194 Updated Mar 16, 2023

Automatically visualize your pandas dataframe via a single print! 📊 💡

Python 5,365 377 Updated Mar 20, 2024

Some useful tips for faiss

Shell 629 48 Updated Sep 1, 2025

Pretrained ELECTRA Model for Korean

Python 628 136 Updated Feb 19, 2024

TOROS N2 - lightweight approximate Nearest Neighbor library which runs fast even with large datasets

Jupyter Notebook 581 69 Updated Jun 27, 2023

Jejueo Datasets for Machine Translation and Speech Synthesis

Python 83 11 Updated Feb 19, 2020

The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)

Python 119 10 Updated Oct 8, 2020

깔끔한 파이썬 탄탄한 백엔드 소스코드 정리

Python 43 3 Updated Nov 4, 2022
Next