Publications and research outputs

Vision Language Model, Multimodal

Selected papers across vision-language models, multimodal learning, medical AI, robust perception, forecasting, and explainable systems. Use the search box or topic filters to browse the list quickly.

10 Papers and preprints

7 First or co-first author

5 Research themes

2026

3 papers

2026

ECCV 2026 Co-first author

Blind to Position, Biased in Language: Probing Mid-Layer Representational Bias in Vision-Language Encoders for Zero-Shot Language-Grounded Spatial Understanding

Na Min An*, Inha Kang*, Minhyun Lee, Hyunjung Shim

Probes mid-layer vision-language representations to uncover positional blindness and language-dependent bias, improving zero-shot language-grounded spatial understanding.

Paper

2026

ICLR 2026 First author

What “Not” to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging

Inha Kang, Y Lim, S Lee, J Choi, J Choe, H Shim

Identifies the affirmative bias of VLMs when processing negation and improves detection accuracy with a token-merging module and a reasoning-aware data pipeline.

Paper

2026

CVPR 2026 First author

Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization

CVPR 2026 Compute Transparency Champion

Inha Kang, E Kim, W Ryu, J Shin, S Yu, YH Kang, S Jeong, E Kim, S Kim, H Shim

Applies GRPO with asymmetric rewards to reduce false alarms and deliver reliable five-day air quality forecasts for East Asia.

Paper

2025

2 papers

2025

CVPR 2025 Robust perception

No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather

J Park, H Lee, Inha Kang, H Shim

Addresses safety-critical failures in weather-degraded LiDAR by emphasizing object geometry through physics-inspired augmentation.

Paper

2025

EMNLP Findings 2025 Multimodal

3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation

S Lee*, J Choi*, Inha Kang, J Kim, J Park, H Shim

Transfers structural knowledge from 3D models into 2D VLMs, improving zero-shot 3D classification without requiring large 3D datasets.

Paper

2023

2 papers

2023

CVPR 2023 Meta-research

Why is the winner the best?

M Eisenmann, A Reinke, V Weru, …, Inha Kang, et al.

Studies the reliability of ranking systems in biomedical AI challenges and shows how unstable metrics can misrepresent performance differences.

Paper

2023

J. Hazardous Materials 2023 Co-first author

Three-Dimensional Label-Free Visualization of the Interactions of PM2.5 with Macrophages and Epithelial Cells Using Optical Diffraction Tomography

WS Lee*, Inha Kang*, SJ Yoon, et al.

Uses optical diffraction tomography for 3D, label-free visualization of fine dust uptake, enabling quantitative analysis without phototoxicity.

Paper

2022

2 papers

2022

MICCAI 2022 1st place challenge

Joint Embedding of 2D and 3D Networks for Medical Image Anomaly Detection

Inha Kang, J Park

Combines 2D texture cues and 3D volumetric context to detect subtle anomalies in brain MRI and abdominal CT, winning the MICCAI MOOD Challenge.

Paper

2022

KTCP 2022 First author

End-to-End Vertebra CT Image Segmentation Network with the 3D Surface-Enhanced Module and the Trainable Preprocessing Method

Inha Kang, JH Cho, J Park

Integrates trainable preprocessing and 3D surface enhancement into an end-to-end segmentation network for vertebra CT analysis.

Paper

2021

1 paper

2021

MICCAI 2021 Co-first author

Self-Supervised 3D Out-of-Distribution Detection via Pseudoanomaly Generation

JH Cho*, Inha Kang*, J Park

Introduces pseudoanomaly generation so 3D anomaly detectors can learn from normal data only, winning the MICCAI MOOD Challenge.

Paper

Vision Language Model, Multimodal

No publications matched your search.

2026

Blind to Position, Biased in Language: Probing Mid-Layer Representational Bias in Vision-Language Encoders for Zero-Shot Language-Grounded Spatial Understanding

What “Not” to Detect: Negation-Aware VLMs via Structured Reasoning and Token Merging

Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization

2025

No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather

3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation

2023

Why is the winner the best?

Three-Dimensional Label-Free Visualization of the Interactions of PM2.5 with Macrophages and Epithelial Cells Using Optical Diffraction Tomography

2022

Joint Embedding of 2D and 3D Networks for Medical Image Anomaly Detection

End-to-End Vertebra CT Image Segmentation Network with the 3D Surface-Enhanced Module and the Trainable Preprocessing Method

2021

Self-Supervised 3D Out-of-Distribution Detection via Pseudoanomaly Generation