hckrnws

Lossless LLM compression for efficient GPU inference via dynamic-length float

(arxiv.org)
356
22
18h

by CharlesW

jhj
15h
iandanforth
14h
VladVladikoff
12h
bjornsing
7h
ironbound
5h
vessenes
12h
zorgmonkey
11h
refibrillator
9h
liuliu
7h
brookst
8h
hinkley
11h
boulos
8h
badmonster
17h
latchkey
15h
airstrike
14h
latchkey
14h
sundarurfriend
12h
latchkey
12h
Ringz
11h
latchkey
11h
saagarjha
11h
latchkey
11h
zarathustreal
41m
miohtama
15h
daveguy
15h
mhitza
14h
gunalx
13h
Der_Einzige
10h
LoganDark
9h
Der_Einzige
10h
danielmarkbruce
16h
jhj
13h
striking
16h
kadushka
16h
latchkey
15h
NBJack
14h
latchkey
14h
DrillShopper
13h
latchkey
12h
danielmarkbruce
16h
spoaceman7777
15h
danielmarkbruce
15h
loufe
17h
jonplackett
17h
loufe
7h
Animats
15h
eoerl
13h
aseligman
14h
yjftsjthsd-h
16h
brigade
6h
philjohn
16h
hnuser123456
15h
gitroom
8h
thund
13h
thund
13h
jhj
13h
wills_forward
17h
moffkalast
17h
janalsncm
17h
danielmarkbruce
17h
moffkalast
16h
Der_Einzige
10h
moffkalast
5h
BoorishBears
16h
imtringued
3h
danielmarkbruce
16h
BoorishBears
15h
danielmarkbruce
15h
BoorishBears
14h
danielmarkbruce
14h
kridsdale3
16h
danielmarkbruce
16h
kadushka
16h
omneity
15h
throwaway314155
16h
firefoxd
9h
jsemrau
11h
xmasotto
10h
buildbot
6h
mountainriver
17h
luotuoshangdui
16h
aazo11
14h
marksimi
16h
iamnotagenius
17h
sroussey
17h
spindump8930
16h
jasonjmcghee
14h
gojomo
17h
svachalek
17h
iamnotagenius
15h
newuser111
11h
fxegdfvbfds
12h
ein0p
17h
timschmidt
17h
ein0p
16h
timschmidt
14h
ein0p
14h
ow5
16h
ein0p
15h
hchja
16h
spindump8930
16h
throwaway314155
16h
anticensor
15h
vessenes
12h
Havoc
17h
artemisart
17h
Vendan
17h
brokencode
17h
vintermann
17h
8ytecoder
17h
ziddoap
17h

Crafted by Rajat

Source Code