Music |
Video |
Movies |
Chart |
Show |
HPCA' SpAtten: Efficient Sparse Attention Architecture w/ Cascade Token/Head Pruning by Hanrui Wang (Hanrui Wang) View | |
Short Intro HPCA'21 SpAtten: Efficient Sparse Attention Architecture with Cascade Token/Head Pruning (MIT HAN Lab) View | |
HPCA'22: QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits Hanrui Wang (MIT HAN Lab) View | |
[HPCA'22 ]ANNA: Specialized Architecture for Approximate Nearest Neighbor Search (ARC SNU) View | |
HPCA Conference presentation (Corne Meintjes) View | |
[HPCA'20] Tensaurus: A Versatile Accelerator for Mixed Sparse-Dense Tensor Computations (Cornell Zhang Research Group) View | |
Is Sparse Attention more Interpretable (Clara Meister) View | |
ParaDox: Eliminating Voltage Margins via Heterogeneous Fault Tolerance (Long Talk, HPCA 2021) (Sam Ainsworth) View | |
[HPCA'21] Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework (CoCoPIE Real-time AI on Mobile) View | |
Big Bird: Transformers for Longer Sequences (The NLP Lab) View |