Welcome to FlexModel’s documentation!
FlexModel is a wrapper for Pytorch models which exposes powerful primitives for model surgery and introspection.
Getting Started
Check-out our examples for single and multi-gpu, which use both megatron-style layers and PyTorch’s FSDP wrapper. Additionally, we have a demo which includes code for induction head identification in Llama-2-70b.