LiWentomng

🎯

Focusing

LI Wentong LiWentomng

🎯

Focusing

PhD@ZJU, Computer Vision.

108 followers · 159 following

Zhejiang University
Hang Zhou
https://cslwt.github.io

Achievements

Block or Report

Block or report LiWentomng

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

agnJason / XHand

Official pytorch implementation of "XHand: Real-time Expressive Hand Avatar"

Python 19 Updated Jul 31, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 6,603 328 Updated Aug 1, 2024

CircleRadon / TokenPacker

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".

Python 114 4 Updated Jul 26, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,133 2,320 Updated Aug 1, 2024

bfshi / scaling_on_scales

When do we not need larger vision models?

Python 278 9 Updated Jul 12, 2024

ShiArthur03 / ShiArthur03

MATLAB 10,291 1,937 Updated Jul 16, 2024

test-time-training / ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 859 49 Updated Jul 14, 2024

DAMO-NLP-SG / VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 595 37 Updated Jul 29, 2024

agnJason / PianoMotion10M

Code release for PianoMotion10M

Python 44 2 Updated Jun 15, 2024

apple / ml-4m

4M: Massively Multimodal Masked Modeling

Python 1,463 85 Updated Jul 17, 2024

OpenGVLab / OmniCorpus

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

205 4 Updated Jun 16, 2024

yangxue0827 / STAR-MMRotate

Oriented object detection on STAR dataset.

Python 35 1 Updated Jul 9, 2024

microsoft / Samba

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 731 39 Updated Jul 31, 2024

X2FD / LVIS-INSTRUCT4V

127 Updated Dec 22, 2023

modelscope / swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Python 2,672 238 Updated Aug 1, 2024