ARC_SNU

All

81 repositories

Libra
Public
[ICLR 2026] Libra: Effective yet Efficient Load Balancing for Large-Scale MoE Inference
0•0•0•0•Updated Mar 3, 2026Mar 3, 2026
Libra-Internal
Public
Apache License 2.0
•0•0•0•0•Updated Feb 28, 2026Feb 28, 2026
Libra-Core
Public
Python
•
Apache License 2.0
•0•0•0•0•Updated Feb 28, 2026Feb 28, 2026
GS-Scale
Public
[ASPLOS '26] Fast, memory efficient, and scalable 3D Gaussian Splatting training framework
Cuda
•
MIT License
•1•16•0•0•Updated Feb 9, 2026Feb 9, 2026
DecDEC
Public
[OSDI 2025] DecDEC: A Systems Approach to Advancing Low‑Bit LLM Quantization
Python
•3•22•0•0•Updated Jan 29, 2026Jan 29, 2026
flashTP
Public
Torch-native C++/CUDA library to accelerate tensor-product layers in MLIPs
Cuda
•
MIT License
•4•55•1•0•Updated Nov 26, 2025Nov 26, 2025
NestedFP
Public
[NeurIPS 2025] NestedFP: High-Performance, Memory-Efficient Dual-Precision Floating Point Support for LLMs
HTML
•0•7•0•0•Updated Nov 21, 2025Nov 21, 2025
DP-LLM
Public
[NeurIPS 2025] DP-LLM: Runtime Model Adaptation with Dynamic Layer-wise Precision Assignment
Python
•
MIT License
•7•7•0•0•Updated Oct 24, 2025Oct 24, 2025
DP-LLM_pre_finetuned
Public
Pre-finetuned results for DP-LLM.
MIT License
•0•0•0•0•Updated Oct 23, 2025Oct 23, 2025
FastPoint
Public
[ICCV 2025] FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction
Python
•1•19•0•0•Updated Sep 18, 2025Sep 18, 2025
any-precision-llm
Public
[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs
Python
•
MIT License
•7•122•2•0•Updated Jul 4, 2025Jul 4, 2025
ADA-NNS
Public
Python
•0•5•0•0•Updated Apr 4, 2025Apr 4, 2025
DRAM_FAULT_SIM
Public
C++
•0•2•0•0•Updated Feb 25, 2025Feb 25, 2025
gem5
Public
forked from https://github.com/gem5/gem5
C++
•
BSD 3-Clause "New" or "Revised" License
•1.7k•0•0•0•Updated Jan 14, 2025Jan 14, 2025
Ginex
Public
Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching
Python
•8•41•2•1•Updated Jul 10, 2024Jul 10, 2024
Frugal_PN_Training
Public
[ECCV 2024] Frugal 3D Point Cloud Model Training via Progressive Near Point Filtering and Fused Aggregation
Python
•0•4•0•0•Updated Jul 4, 2024Jul 4, 2024
2024_spring_sysprog_Lab5
Public
C
•0•8•12•0•Updated Jun 16, 2024Jun 16, 2024
2024_spring_sysprog_Lab4
Public
C
•0•8•23•0•Updated May 27, 2024May 27, 2024
2024_spring_sysprog_Lab3
Public
C
•0•7•38•0•Updated Apr 28, 2024Apr 28, 2024
gem-forge-gem5
Public
C++
•
BSD 3-Clause "New" or "Revised" License
•7•0•0•0•Updated Apr 22, 2024Apr 22, 2024
2024_spring_sysprog_Lab2
Public
C
•3•6•30•0•Updated Apr 1, 2024Apr 1, 2024
gem-forge-transform
Public
C
•3•0•0•0•Updated Mar 9, 2024Mar 9, 2024
2024_spring_sysprog_Lab1
Public
C
•2•3•2•0•Updated Mar 7, 2024Mar 7, 2024
KVRouter
Public
C++
•0•0•0•0•Updated Mar 1, 2024Mar 1, 2024
gem-forge-framework
Public
Makefile
•
BSD 2-Clause "Simplified" License
•10•0•0•0•Updated Feb 24, 2024Feb 24, 2024
ActviationNMSParsity_gpgpusim
Public
C++
•
Other
•0•0•0•0•Updated Feb 7, 2024Feb 7, 2024
ActivationNMSparisty
Public
C++
•
Other
•0•0•0•0•Updated Feb 7, 2024Feb 7, 2024
gem-forge-llvm
Public
C++
•3•0•0•0•Updated Oct 19, 2023Oct 19, 2023
FusedMM
Public
Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks"
C
•5•0•0•0•Updated Oct 5, 2023Oct 5, 2023
NotAllNeighborsMatter
Public
Python
•0•2•0•0•Updated Jun 27, 2023Jun 27, 2023