Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

M P's picture

M P

kustomkoder

·

AI & ML interests

None yet

Organizations

None yet

kustomkoder 's collections 3

Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying

Paper • 2405.07653 • Published May 13, 2024

Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use

Paper • 2403.02626 • Published Mar 5, 2024 • 11
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 29
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Paper • 2509.21245 • Published Sep 25 • 39
Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 81

IDEA-Bench: How Far are Generative Models from Professional Designing?

Paper • 2412.11767 • Published Dec 16, 2024
GeoRemover: Removing Objects and Their Causal Visual Artifacts

Paper • 2509.18538 • Published Sep 23
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Paper • 2505.21333 • Published May 27 • 38
Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying

Paper • 2405.07653 • Published May 13, 2024

Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying

Paper • 2405.07653 • Published May 13, 2024

IDEA-Bench: How Far are Generative Models from Professional Designing?

Paper • 2412.11767 • Published Dec 16, 2024
GeoRemover: Removing Objects and Their Causal Visual Artifacts

Paper • 2509.18538 • Published Sep 23
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Paper • 2505.21333 • Published May 27 • 38
Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying

Paper • 2405.07653 • Published May 13, 2024

Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use

Paper • 2403.02626 • Published Mar 5, 2024 • 11
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 29
Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Paper • 2509.21245 • Published Sep 25 • 39
Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24 • 81

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs