Ji Ha Jang

Research

I'm interested in multimodal, generative, commonsense AI, and low-level computer vision. My work is driven by a deep curiosity about how AI can better understand and interact with the complexities of the world, combining various modalities. Highlighted papers are representative works.

HyFL-CLIP: Hyperbolic Fine-Tuning of CLIP for Robust Long-Context Understanding

Ji Ha Jang*, Hayeon Kim*, Chulwon Lee, Junghun James Kim, Se Young Chun

ECCV, 2026

project page code paper

We propose HyFL-CLIP (Hyperbolic fine-tuning of CLIP) for enhancing long-context image-text alignment in CLIP. HyFL-CLIP models hierarchical part-to-whole semantics in hyperbolic space by linking long descriptions, short textual components, and images. It achieves robust performance on long-context retrieval, perturbation-robust retrieval, intra-modality retrieval, and short-text retrieval benchmarks.

Uncertainty-guided Compositional Hyperbolic Alignment with Part-to-Whole Semantic Representativeness

Hayeon Kim*, Ji Ha Jang*, Se Young Chun

CVPR, 2026 Highlight

project page code paper

We propose UNCHA for enhancing hyperbolic VLMs. UNCHA models part-to-whole semantic representativeness with hyperbolic uncertainty, assigning lower uncertainty to more representative parts and higher uncertainty to less representative ones. UNCHA achieves state-of-the-art performance on zero-shot classification, retrieval, and multi-label classification benchmarks.

RoMaP: Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling

Hayeon Kim*, Ji Ha Jang*, Se Young Chun

ICCV, 2025

project page code paper

We propose RoMaP, a novel framework for local 3D Gaussian editing that enables precise and flexible part-level modifications. RoMaP introduces a geometry-aware 3D mask prediction module and a regularized SDS loss to constrain edits to target regions while preserving context.

INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding

Ji Ha Jang*, Hoigi Seo*, Se Young Chun

ECCV, 2024

project page paper

We present INTRA, a novel framework for affordance grounding which enables training without egocentric images, grounds different parts for different interactions on the same object, and enables free-form text input.

PODIA-3D: Domain Adaptation of 3D Generative Model Across Large Domain Gap Using Pose-Preserved Text-to-Image Diffusion

Gwanghyun Kim, Ji Ha Jang, Se Young Chun

ICCV, 2023

project page paper

We propose PODIA-3D, a novel pipeline that uses pose-preserved text-to-image diffusion-based domain adaptation for 3D generative models.

Ji Ha Jang

News

Research