Tom Turney
@TheTomWorking on LLM inference systems, KV cache compression, and kernel-level optimizations (TurboQuant).
Language Breakdown
Lines of code distribution across 21 owned repositories
T-Shaped Developer
T-shapedDeep in Swift with broad versatility
Collaboration Network
Global Impact visualization
Repos
41
PRs
0
Growth
+18%
Top Collaborators
No collaborator data yet.
Coding Streak
Contribution activity over the past year
Top Repositories
LLM inference in C/C++
vLLM Metal plugin powered by mlx-swift — high-performance LLM inference on Apple Silicon
I'm crazy and trying to make a ForScan OBD reader work on my mac.
Open long-context inference stack: retrieval + open weights, no closed parts. pip install longctx.
A high-throughput and memory-efficient inference and serving engine for LLMs
Deterministic State Recovery for AI Coding Agents
Driverless NVIDIA Pascal (GTX 1060) compute from macOS Apple Silicon over Thunderbolt eGPU
MLX: An array framework for Apple silicon
Add ranged link support to Obsidian
Open Source Impact
Contributions to external projects