Skip to content
View vztu's full-sized avatar
🦝
Feeding Raccoons
🦝
Feeding Raccoons

Highlights

  • Pro
Block or Report

Block or report vztu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

Python 373 11 Updated Aug 2, 2024

Grounding Image Matching in 3D with MASt3R

Python 608 21 Updated Jul 30, 2024

✨✨Latest Advances on Multimodal Large Language Models

10,987 724 Updated Aug 2, 2024

Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"

Python 383 13 Updated Feb 27, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,618 101 Updated Jul 29, 2024

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 284 15 Updated Aug 1, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 7,766 409 Updated Aug 2, 2024

[ICLR2024] HEAL: An Extensible Framework for Open Heterogeneous Collaborative Perception ���️ All You Need for Multi-Modality Collaborative Perception!

Python 134 7 Updated Jul 23, 2024

[ECCV 2024] DriveLM: Driving with Graph Visual Question Answering

HTML 744 46 Updated Jul 26, 2024

[ECCV2024] CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians

155 5 Updated Jul 18, 2024

[CVPR 2024] DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes

211 4 Updated Jul 25, 2024

Official implementation of paper titled "GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model"

Python 47 2 Updated Jul 19, 2024

[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference

Python 25 1 Updated Jul 18, 2024

[ICRA 2023] V2XP-ASG: Generating Adversarial Scenes for Vehicle-to-Everything Perception

Python 18 3 Updated Oct 17, 2023

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Python 1,154 101 Updated May 13, 2024

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)

Jupyter Notebook 254 14 Updated Jun 27, 2024

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 509 25 Updated Jul 25, 2024

About [CVPR 2024] The official implementation of paper " Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving"

Python 27 Updated Jul 19, 2024

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 518 17 Updated Jul 10, 2024

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

Python 382 32 Updated Jul 31, 2024

STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

100 1 Updated Jun 18, 2024

🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS 2024 workshop @ CVPR 2024

Python 34 1 Updated Jul 18, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,102 39 Updated Jul 14, 2024

[CVPR 2024] Official implementation of "Towards Realistic Scene Generation with LiDAR Diffusion Models"

Python 147 9 Updated May 21, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,898 295 Updated Jul 16, 2024

An AI agent that beats the classic game "Snake".

Python 1,563 350 Updated Apr 30, 2024

Awesome Papers related to Mamba.

1,013 51 Updated Jul 19, 2024
Next