Multi-proposal collaboration and multi-task training for weakly-supervised video moment retrieval
Bolin Zhang,
Chao Yang,
Bin Jiang,
Takahiro Komamizu,
Ichiro Ide International Journal of Machine Learning and Cybernetics, Vol.16, 7-8, pp.4509-4524, August 2025.
Multi-proposal collaboration and multi-task training for weakly-supervised video moment retrieval
Bolin Zhang,
Chao Yang,
Bin Jiang,
Takahiro Komamizu,
Ichiro Ide International Journal of Machine Learning and Cybernetics, Vol.16, 7-8, pp.4509-4524, August 2025.
MultiSensor-Home: Benchmark for Multi-modal Multi-view Action Recognition in Home Environments
Trung Thanh Nguyen,
Yasutomo Kawanishi,
Vijay John,
Takahiro Komamizu,
Ichiro Ide Unknown Journal, IS3-038, August 2025.
MultiSensor-Home: Benchmark for Multi-modal Multi-view Action Recognition in Home Environments
Trung Thanh Nguyen,
Yasutomo Kawanishi,
Vijay John,
Takahiro Komamizu,
Ichiro Ide Unknown Journal, IS3-038, August 2025.
Visual Adapter for Extracting Textually-related Features for Video Captioning
Junan Chen,
Trung Thanh Nguyen,
Takahiro Komamizu,
Ichiro Ide Unknown Journal, IS3-148, August 2025.
Visual Adapter for Extracting Textually-related Features for Video Captioning
Junan Chen,
Trung Thanh Nguyen,
Takahiro Komamizu,
Ichiro Ide Unknown Journal, IS3-148, August 2025.
MLLM-based Dataset Construction for Hazard-aware Guidance for the Visually Impaired
Peiyuan ZHU,
Marc A. Kastner,
Hirotaka Kato,
Takatsugu Hirayama,
Takahiro Komamizu,
Ichiro Ide Unknown Journal, IS2-140, July 2025.
MLLM-based Dataset Construction for Hazard-aware Guidance for the Visually Impaired
Peiyuan ZHU,
Marc A. Kastner,
Hirotaka Kato,
Takatsugu Hirayama,
Takahiro Komamizu,
Ichiro Ide Unknown Journal, IS2-140, July 2025.
MLLM-based Dataset Construction for Hazard-aware Guidance for the Visually Impaired
Peiyuan ZHU,
Marc A. Kastner,
Hirotaka Kato,
Takatsugu Hirayama,
Takahiro Komamizu,
Ichiro Ide Unknown Journal, IS2-140, July 2025.
Analysis and prediction of attractive fonts on title-overlaid food images
Nanami Takagi,
Haruya Kyutoku,
Keisuke Doman,
Takahiro Komamizu,
Ichiro Ide Proceedings of the 19th Int. Conf. on Machine Vision Applications (MVA2025), July 2025.
Analysis and prediction of attractive fonts on title-overlaid food images
Nanami Takagi,
Haruya Kyutoku,
Keisuke Doman,
Takahiro Komamizu,
Ichiro Ide Proceedings of the 19th Int. Conf. on Machine Vision Applications (MVA2025), July 2025.
Action Selection Learning for Weakly Labeled Multi-modal Multi-view Action Recognition
Trung Thanh Nguyen,
Yasutomo Kawanishi,
Vijay John,
Takahiro Komamizu,
Ichiro Ide ACM Transactions on Multimedia Computing, Communications, and Applications, June 2025.
Action Selection Learning for Weakly Labeled Multi-modal Multi-view Action Recognition
Trung Thanh Nguyen,
Yasutomo Kawanishi,
Vijay John,
Takahiro Komamizu,
Ichiro Ide ACM Transactions on Multimedia Computing, Communications, and Applications, June 2025.
MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion
Trung Thanh Nguyen,
Yasutomo Kawanishi,
Vijay John,
Takahiro Komamizu,
Ichiro Ide The 19th IEEE International Conference on Automatic Face and Gesture Recognition, pp.1-10, May 2025.
MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion
Trung Thanh Nguyen,
Yasutomo Kawanishi,
Vijay John,
Takahiro Komamizu,
Ichiro Ide The 19th IEEE International Conference on Automatic Face and Gesture Recognition, pp.1-10, May 2025.