FlashInfer is a library and kernel generator for Large Language Models that provides high-performance implementation of LLM GPU kernels such as FlashAttention, SparseAttention, PageAttention, Sampling ...
ghidriff provides a command-line binary diffing capability with a fresh take on diffing workflow and results. It leverages the power of Ghidra's ProgramAPI and FlatProgramAPI to find the added, ...