Tsu-Jui Fu
Cited by
Cited by
GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction
TJ Fu, PH Li, WY Ma
ACL (Long), 2019
VIOLET: End-to-End Video-Language Transformers with Masked Visual-token Modeling
TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu
arXiv:2111.12681, 2021
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
W Feng, X He, TJ Fu, V Jampani, A Akula, P Narayana, S Basu, XE Wang, ...
ICLR, 2023
Dynamic Video Segmentation Network
YS Xu, TJ Fu*, HK Yang*, CY Lee
CVPR, 2018
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
ZW Hong, TY Shann, SY Su, YH Chang, TJ Fu, CY Lee
NeurIPS, 2018
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling
TJ Fu, X Wang, M Peterson, S Grafton, M Eckstein, WY Wang
ECCV (Spotlight), 2020
Attentive and Adversarial Learning for Video Summarization
TJ Fu, SH Tai, HT Chen
WACV (Oral), 2019
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
W Feng*, W Zhu*, T Fu, V Jampani, A Akula, X He, S Basu, XE Wang, ...
NeurIPS, 2023
Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER
PH Li, TJ Fu, WY Ma
AAAI (Oral), 2020
Language-Driven Artistic Style Transfer
TJ Fu, XE Wang, WY Wang
ECCV, 2022
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
TJ Fu*, L Li*, Z Gan, K Lin, WY Wang, L Wang, Z Liu
CVPR, 2023
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
TJ Fu, WY Wang, D McDuff, Y Song
AAAI, 2022
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning
TJ Fu, X Wang, S Grafton, M Eckstein, WY Wang
EMNLP (Oral), 2020
Guiding Instruction-based Image Editing via Multimodal Large Language Models
TJ Fu, W Hu, X Du, WY Wang, Y Yang, Z Gan
ICLR (Spotlight), 2024
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
TJ Fu, L Yu, N Zhang, CY Fu, JC Su, WY Wang, S Bell
CVPR, 2023
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
W Zhu, X Wang, TJ Fu, A Yan, P Narayana, K Sone, S Basu, WY Wang
EACL (Long), 2021
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View
R Schumann, W Zhu, W Feng, TJ Fu, S Riezler, WY Wang
AAAI, 2024
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers
TJ Fu, XE Wang, ST Grafton, MP Eckstein, WY Wang
CVPR, 2022
CPL: Counterfactual Prompt Learning for Vision and Language Models
X He, D Yang, W Feng, TJ Fu, A Akula, V Jampani, P Narayana, S Basu, ...
EMNLP (Long), 2022
Speed Reading: Learning to Read ForBackward via Shuttle
TJ Fu, WY Ma
EMNLP (Long), 2018
The system can't perform the operation now. Try again later.
Articles 1–20