Skip to content
View hazdzz's full-sized avatar
  • National Central University
  • Taiwan

Highlights

  • Pro

Block or report hazdzz

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"

Python 147 9 Updated Jun 10, 2019

A blazing fast, information dense media player built with Next.js.

TypeScript 184 6 Updated Sep 17, 2024

Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel

Python 247 13 Updated Sep 20, 2024

Mask Attention Networks: Rethinking and Strengthen Transformer in NAACL2021

Python 15 5 Updated Jun 3, 2021

An automatic compilation script for 7-Zip, which replaces the default file association icons and file manager skins with more attractive ones, and adds associations for Jar and War files.

PowerShell 103 22 Updated Aug 25, 2024

小狼毫输入法配置方案整合包,整合了雾凇拼音和空山五笔方案,做到了配置方案的一键安装,为初入中州韵输入法的小白提供便利。

Lua 25 Updated Jul 25, 2024

Writing AI Conference Papers: A Handbook for Beginners

801 20 Updated Sep 19, 2024

Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"

Python 52 2 Updated Sep 9, 2024

Using FlexAttention to compute attention with different masking patterns

Python 28 Updated Sep 10, 2024

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 4,632 146 Updated Sep 11, 2024

一款内网综合扫描工具,方便一键自动化、全方位漏扫扫描。

Go 10,264 1,565 Updated Aug 29, 2024

A Pytorch implementation of SMU: SMOOTH ACTIVATION FUNCTION FOR DEEP NETWORKS USING SMOOTHING MAXIMUM TECHNIQUE

Python 41 8 Updated Jul 31, 2022

The AdEMAMix Optimizer: Better, Faster, Older.

Python 141 8 Updated Sep 12, 2024

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,757 1,969 Updated Apr 16, 2024

Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.

Jupyter Notebook 97 10 Updated Jun 10, 2021

About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf

Python 301 28 Updated Jul 18, 2024

Code for the paper, "Distribution Augmentation for Generative Modeling", ICML 2020.

Python 121 32 Updated Apr 24, 2023

A download tools for clawing the ebooks from internets.

Go 859 56 Updated Sep 5, 2024

Use your Neovim like using Cursor AI IDE!

Lua 5,262 175 Updated Sep 19, 2024

The Fully Customizable Desktop Environment for Windows 10/11 with a windows tiling manager included.

Rust 1,375 34 Updated Sep 19, 2024

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

Python 1,736 247 Updated Jul 27, 2024

jiant is an nlp toolkit

Python 1,637 297 Updated Jul 6, 2023

[DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations

Python 758 166 Updated Aug 3, 2021

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,206 64 Updated Sep 20, 2024

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 611 47 Updated Sep 13, 2023

FcaNet: Frequency Channel Attention Networks

Python 482 100 Updated Mar 11, 2021

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 945 97 Updated Apr 19, 2024

A curated reading list of research in Mixture-of-Experts(MoE).

524 40 Updated Sep 4, 2023

Copycat Clipboard is an intuitive clipboard manager designed to enhance your workflow. Seamlessly switch between documents, apps, and devices while keeping all your copied items organized and acces…

Dart 304 13 Updated Sep 19, 2024
Next