时间 | 主讲人 | 论文题目 | 附件 | 会议/期刊 |
2023.1.5 | 黄庄湫 | ValueExpert: Exploring Value Patterns in GPU-Accelerated Applications |
| ASPLOS 2022 |
2022.12.15 | 万嘉诚 | 3DSSD: Point-based 3D Single Stage Object Detector | 万嘉诚-12.15.pptx | CVPR 2020 |
2022.12.8 | 孙新雨 | Ansor: Generating High-Performance Tensor Programs for Deep Learning | 2022.12.8论文组会精读_孙新雨.pptx | OSDI 2020 |
2022.12.1 | 龚磊 | Image Amodal Completion: A Survey | ImageAmodalCompletion (2).pptx | Arxiv 2022 |
2022.11.24 | 胡明哲 | Safe and Principled Language Interoperation
| trifonov1999safemlc.pptx | TRIFONOV 1999 |
2022.11.17 | 赖民信 | DISC : A Dynamic Shape Compiler for Machine Learning Workloads | 阅读组会221117-zhu2021disc.pptx | EuroMLSys 2021
|
2022.11.10 | 翟祎 | Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning | Alpa.pptx | Arxiv 2022 |
2022.11.3 | 赵琦 | Fine-Granular Computation and Data Layout Reorganization for Improving Locality | iccad'22-Fine-Granular Computation and Data Layout Reorganization for Improving Locality.pptx | ICCAD 2020 |
2022.10.27 | 黄奕桐 | AsyMo: Scalable and Efficient Deep-Learning Inference on Asymmetric Mobile CPUs | 2022-10-27-AsyMo.pptx | MOBICOM 2022 |
2022.10.20 | 刘硕 | Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization | Compiler-Aware.pptx | ArXiv 2020 |
2022.10.13 | 丁伯尧 | Detecting Blocking Errors in Go Programs using Localized Abstract Interpretation | Detecting Blocking Errors in Go Programs(1).pptx | ACM ISBN |
2022.9.28 | 王顺洪 | Automatic Horizontal Fusion for GPU Kernels | Automatic_Horizontal_Fusion_for_GPU_Kernels.pptx | ISCGO 2022 |
2022.9.21 | 陈铭瑜 | Analytical Modeling Is Enough for High-Performance BLIS | toms2016blis.pptx | TOMS 2016 |
2022.9.14 | 龚磊 | Mask R-CNN | MaskRCNN.pptx |
|
2022.9.7 | 胡明哲 | Operational Semantics for Multi-Language Programs | matthews2007boundary.pptx | POPL 2007 |
2022.8.31 | 李永尚 | 2QAN: a quantum compiler for 2-local qubit hamiltonian simulation algorithms | 2QAN.pptx | ASPLOS2022 |
2022.8.4 | 黄奕桐 | Whale: Efficient Giant Model Training over Heterogeneous GPUs | ATC2022-Whale.pdf 2022-08-04-Whale.pptx |
|
2022.7.28 | 龚磊 | SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation | CVPR2022SHIFT.pdf CVPR22SHIFT.pptx | CVPR2022 |
2022.7.21 | 胡明哲 | Strictly Declarative Specification of Sophisticated Points-to Analyses | Strictly Declarative Specification of Sophisticated Points-to Analyses.pdf doop-based-pointer-analyses.pptx
|
|
2022.7.14 | 陈铭瑜 | Progressive Raising in Multi-level IR | 2021cgo-TacticsDSL.pdf cgo2018-Look-Ahead SLP.pptx |
|
2022.6.30 | 万嘉诚 | Transformers in 3D Point Clouds: A Survey
| Transformers.pdf 万嘉诚-6.30.pptx |
|
2022.6.23 | 谭泽霖 | TVM: An Automated End-to-End Optimizing Compiler for Deep Learning | TVM.pdf |
|
2022.6.16
| 陈金宝 | Uncovering the Hidden Dangers: Finding Unsafe Go Code in the Wild | Uncovering_the_Hidden_Dangers_Finding_Unsafe_Go_Code_in_the_Wild.pdf Uncovering_the_Hidden_Dangers_Finding_Unsafe_Go_Code_in_the_Wild.pptx | TrustCom
|
2022.6.9
| 黄庄湫 | Efficient Transformers A Survey.pdf | Efficient Transformers A Survey.pptx |
|
2022.6.2 | 万嘉诚 | TRT-ViT: TensorRT-oriented Vision Transformer2205.09579v1.pdf | 万嘉诚-6.2 .pptx |
|
2022.5.26 | 谭泽霖
| AI 编译器概览、挑战和实践
| AI编译器的概览、挑战和实践 [自动保存的].pptx |
|
2022.5.19 | 赵琦 | MetaCG:Annotated Call-Graphs to Facilitate Whole-Program Analysis.pdf
| MetaCG.pptx | TAPAS20 |
2022.5.12 | 王顺洪 | Enabling and Exploiting Flexible Task Assignment on GPU through SM-Centric Program Transformations.pdf
| Enabli...mations(1).pptx | ICS2015
|
2022.5.5 | 孙新雨 | Hinterstoisser_On_Pre-Trained_Image_Features_and_Synthetic_Images_for_Deep_Learning_ECCVW_2018_paper.pdf
| 2022.5.5论文组会精读_孙新雨.pptx | ECCV2018 |
2022.4.28 | 刘硕 | Swin Transformer: Hierarchical Vision Transformer using Shifted Windows | bmm-swinT.pptx | ICCV2021
|
2022.4.21 | 丁伯尧
| A Study and Toolkit for Asynchronous Programming in C# | icse14.pptx | ICSE14 |
2022.4.14 | 陈铭瑜 | T2S-Tensor: Productively Generating High-Performance Spatial Hardware for DenseTensor Computations | tccm2019t2s.pptx | FCCM2019 |
2022.4.7 | 翟祎
| Experimental Evaluation of Energy Behavior of Iteration Space Tiling | LCPC2000.pptx | LCPC |
2022.3.31 | 赖民信 | Look-Ahead SLP: Auto-vectorization in the Presenceof Commutative Operations | cgo2018-Look-Ahead SLP.pptx | CGO2018 |
2022.3.24 | 李永尚 | Suppressing ZZ Crosstalk of Quantum Computers through Pulse and Scheduling Co-Optimization | zz crosstalk.pptx |
|
2022.3.17 | 黄奕桐 | Zico: Efficient GPU Memory Sharing for Concurrent DNN Training | Zico.pptx | ATC2021 |
2022.3.10 | 龚磊 | On a Formal Model of Safe and Scalable Self-driving Cars | Mobileye_RSS.pptx |
|
2022.3.3 | 胡明哲 | 博士开题报告 | 开题答辩-胡明哲.pptx |
|
2022.2.24 | 陈铭瑜 | Partial Compilation of Variational Algorithms for Noisy Intermediate-Scale Quantum Machines | micro2019partial.pptx | MICRO2019 |
2022.1.12 | 李永尚 | A Monte Carlo Tree Search Framework for Quantum Circuit Transformation | MCTS.pptx | ICCAD 2020 |
2022.1.5 | 赵琦 | Modular Call Graph Construction for Security Scanning of Node.js Applications | jam.pptx | ISSTA 2021 |