- 🌱 I’m interested in visual foundation models including open-vocabulary detection, interactive segmentation, visual LLM, and image/video generation.
- 🏫 I am a fourth-year Ph.D. student at the Computer Science and Engineering (CSE), The Hong Kong University of Science and Technology (HKUST).
- 🔭 I am an intern in computer vision at International Digital Economy Academy (IDEA).
- 📫 How to reach me: hzhangcx@connect.ust.hk / zhanghaovs120@gmail.com
-
NVIDIA
- Santa Clara
- https://haozhang534.github.io/
Pinned Loading
-
IDEA-Research/DINO
IDEA-Research/DINO Public[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
-
IDEA-Research/detrex
IDEA-Research/detrex Publicdetrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
-
IDEA-Research/MaskDINO
IDEA-Research/MaskDINO Public[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
-
IDEA-Research/DN-DETR
IDEA-Research/DN-DETR Public[CVPR 2022 Oral] Official implementation of DN-DETR
-
IDEA-Research/Grounded-Segment-Anything
IDEA-Research/Grounded-Segment-Anything PublicGrounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
-
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
UX-Decoder/Segment-Everything-Everywhere-All-At-Once Public[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
If the problem persists, check the GitHub status page or contact support.