- College Station, TX
- https://vztu.github.io
- @_vztu
- in/zhengzhongtu
Highlights
- Pro
Block or Report
Block or report vztu
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
✨✨Latest Advances on Multimodal Large Language Models
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
[ICLR2024] HEAL: An Extensible Framework for Open Heterogeneous Collaborative Perception ���️ All You Need for Multi-Modality Collaborative Perception!
[ECCV 2024] DriveLM: Driving with Graph Visual Question Answering
[ECCV2024] CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians
[CVPR 2024] DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes
Official implementation of paper titled "GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model"
[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
[ICRA 2023] V2XP-ASG: Generating Adversarial Scenes for Vehicle-to-Everything Perception
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
About [CVPR 2024] The official implementation of paper " Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving"
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations
🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS 2024 workshop @ CVPR 2024
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
[CVPR 2024] Official implementation of "Towards Realistic Scene Generation with LiDAR Diffusion Models"
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …
An AI agent that beats the classic game "Snake".