
Mostafa Elhoushi
Research Engineer @ Meta
early exit
large language model
speculative decoding
1
presentations
Presentations

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
Mostafa Elhoushi and 12 other authors
Research Engineer @ Meta
early exit
large language model
speculative decoding
presentations
Mostafa Elhoushi and 12 other authors