hi nerds, this is my website so my domain isn't sitting idle
edit: im going to TRY to keep this updated with regular posts. Do not expect me to be consistent :D
feel free to reach out via email or connect with me on Twitter.Exploring a counterintuitive training strategy—drop tokens that seem less important during training to encourage better model generalization...
Investigating persistent memory in transformers: balancing low compression loss, recall of distant tokens, and runtime speed optimization...
Most people don't actually want freedom. They want the illusion of choice within a set of pre-approved options...
Consciousness is a side effect. You think you are you, but that's just the high-level abstraction...