Shu-Jian Huang Ph.D.

Associate Professor, Ph. D. Advisor

Natural Language Processing Research Group

State Key Laboratory of Novel Software Technology

Department of Computer Science and Technology, Nanjing University


[Correspondence] [Biography] [Recruiting] [Research Interests] [Academic Services] [Honors and Awards] [Courses] [Presentations] [Selected Publication]


Correspondence

Office:

Room 902, Computer Science and Technology Building, Xianlin Campus of Nanjing University

163 Xianlin Avenue, Nanjing 210023, China

E-mail:

huangsj at nju dot edu dot cn


Biography

Currently, I am an associate professor and Ph. D. Advisor in Department of Computer Science and Technology of Nanjing University, member of the National Key Laboratory of Novel Software Technology.

I received my B.Sc. degree and Ph. D. in Computer Science in Jun. 2006 and Jun. 2012 from Nanjing University, respectively. I am a member of NLP Group since undergraduate in Sep. 2005, led by Prof. Jiajun Chen. During my Ph. D. study, I spent 11 months (from Oct. 2007 to Aug. 2008) as visiting student in NLC group, MSRA, where I worked with Long Jiang in Chinese Couplet project and in SMT team with Mu Li, Henry Li and Dongdong Zhang. I also spent 12 months as a visiting student in InterACT lab, LTI, CMU, working with Prof. Stephan Vogel.

I was awarded the Excellent Young Scholar Research Project by Jiangsu Provincial Research Foundation in 2017, CCF-NLPCC Young Outstanding Scientist Award in 2020, CIPSC Hanwang Youth Innovation Award in 2022. Our alumni Hao Zhou, Zaixiang Zheng, Yu Bao (co-advised with Prof. Jiajun Chen) were awarded Best Ph. D. Thesis Awards.

Our translation quality estimation research and systems won the first place in WMT2022 QE subtasks (word-level MQM (En-De), sentence-level MQM (En-De)) and all 3 tasks in En-De direction in WMT2023 QE (word-level MQM, sentence-level MQM, error span detection).


Recruiting

I am looking for highly motivated undergraduate students to work together on NLP problems. If you have no NLP background, please consider joining our NLP summer camp first (every summer, mainly for freshman or sophomore). There are some talks about our research on bilibili.com. Please take a look before you apply.

We are also expecting post-doc researchers to work together on NLP research or applications.

I am terribly sorry for not being able to reply all emails applying for a Ph.D. position (overwhelmed by the applications). My reply is usually fast. Please consider me as unavailable if no reply within 3~4 working days.


Research Interests

My research is supported by projects from National Natural Science Foundation of China (NSFC), National Key R&D Program of China and the Jiangsu Provincial Research Foundation for Basic Research. We also have wide collaborations with industrial labs in Baidu, Tencent, Alibaba, ByteDance, Huawei, ZTE, etc.

My research interests lie in natural language processing (NLP), one of the hottest and most fundamental challenges in artificial intelligence, which is to automatically understand and generate natural language texts. My group and I are working on problems such as machine translation, summarization, question answering, etc. These problems usually require a deep understanding of languages, as well as the ability of fluent generation. Previous attempts to these problems involve lexical/syntactic/semantic analysis of natural language texts. Nowadays, the development of pretrained models and large language models (LLMs) provides other possibilities.

We are particularly interested in designing and applying statistical methods/models (including deep learning models, LLMs) for these problems. Currently, we mainly focus on the following topics:


Academic Services

Services in Academic Society

Services in Conferences and Events


Honors and Awards


Courses


Presentations

(mostly in Chinese, hosted on bilibili.com)


Selected Publications

A More Complete List on Google Scholar

* marks corresponding author(s).

2024

Diffusion Language Models Are Versatile Protein Learners.
Xinyou Wang, Zaixiang Zheng, Fei Ye, Dongyu Xue, Shujian Huang, Quanquan Gu*
arXiv:2402.18567

Measuring Meaning Composition in the Human Brain with Composition Scores from Large Language Models.
Changjiang Gao, Jixing Li*, Jiajun Chen, Shujian Huang*.
arXiv:2403.04325 (code)

Question Translation Training for Better Multilingual Reasoning.
Wenhao Zhu, Shujian Huang*, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch.
arXiv:2401.07817 (code)

MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization.
Shuaijie She, Wei Zou, Shujian Huang*, Wenhao Zhu, Xiang Liu, Xiang Geng, Jiajun Chen.
arXiv:2401.06838 (code)

Multi-Candidate Speculative Decoding.
Sen Yang, Shujian Huang*, Xinyu Dai, Jiajun Chen.
arXiv:2401.06706 (code)

Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation.
Xu Huang, Zhirui Zhang*, Xiang Geng, Yichao Du, Jiajun Chen, Shujian Huang*.
arXiv:2401.06568

Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly.
Changjiang Gao, Hongda Hu, Peng Hu, Jiajun Chen, Jixing Li, Shujian Huang*.
Accpeted by NAACL 2024 (arXiv:2403.04325)

MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation.
Jiahuan Li, Shanbo Cheng, Shujian Huang*, Jiajun Chen.
Accpeted by NAACL 2024 (arXiv:2403.09522)

A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily.
Peng Ding, Jun Kuang, Dan Ma, Xuezhi Cao, Yunsen Xian, Jiajun Chen, Shujian Huang*.
Accpeted by NAACL 2024 (arXiv:2311.08268) (code)

Exploring the Factual Consistency in Dialogue Comprehension of Large Language Models.
Shuaijie She, Shujian Huang*, Xingyun Wang, Yanke Zhou, Jiajun Chen.
Accpeted by NAACL 2024 (arXiv:2311.07194)

Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis.
Wenhao Zhu, Hongyi Liu, Qingxiu Dong, Jingjing Xu, Shujian Huang*, Lingpeng Kong, Jiajun Chen, Lei Li.
Accepted by Findings of NAACL 2024 (arXiv:2304.04675) (video, code)

kNN-BOX: A Unified Framework for Nearest Neighbor Generation.
Wenhao Zhu, Qianfeng Zhao, Yunzhe Lv, Shujian Huang*, Siheng Zhao, Sizhe Liu, Jiajun Chen.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics System Demonstrations (EACL2024), pages 10–17. (arXiv:2302.13574) (video, code)

2023

Dictionary Definition Augemented Neural Machine Translation for Anciet Chinese Text.
Jiahuan Li, Ruochun Wu, Wenjing Hu, Jixuan Chen, Weilu Xu, Shujian Huang*, Jiajun Chen.
Accepted by CCMT2023 (in Chinese). Best Paper Award.

IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems.
Xu Huang, Zhirui Zhang, Ruize Gao, Yichao Du, Lemao Liu, Guoping Huang, Shuming Shi, Jiajun Chen, Shujian Huang*.
In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 14903-14917 December 6-10, 2023.

Improved Pseudo Data for Machine Translation Quality Estimation with Constrained Beam Search.
Xiang Geng, Yu Zhang, Zhejian Lai, Shuaijie She, Wei Zou, Shimin Tao, Hao Yang, Jiajun Chen, Shujian Huang*.
In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 12434-12447 December 6-10, 2023.

Roles of Scaling and Instruction Tuning in Language Perception: Model vs. Human Attention.
Changjiang Gao, Shujian Huang*, Jixing Li, Jiajun Chen.
In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 13042-13055 December 6-10, 2023.

Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation.
Zihan Liu, Zewei Sun, Shanbo Cheng, Shujian Huang, Mingxuan Wang.
In Proceedings of IJCNLP&AACL 2023, pages 733-743, November 1-4, 2023.

Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating Vision-Language Models.
Zheng Ma, Mianzhi Pan, Wenhan Wu, Kanzhi Cheng, Jianbing Zhang*, Shujian Huang*, and Jiajun Chen.
In Proceedings of ACMMM’2023, pages 5674-5681, October 29-November 3, 2023, Ottawa, ON, Canada.

Extrapolating Large Language Models to Non-English by Aligning Languages.
Wenhao Zhu, Yunzhe Lv, Qingxiu Dong, Fei Yuan, Jingjing Xu, Shujian Huang*, Lingpeng Kong, Jiajun Chen, Lei Li.
arXiv:2308.04948 (video, code)

Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions.
Jiahuan Li, Hao Zhou, Shujian Huang*, Shanbo Cheng, Jiajun Chen.
Accepted by TACL (arXiv:2305.15083) (video, code)

What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation.
Wenhao Zhu, Shujian Huang*, Yunzhe Lv, Xin Zheng and Jiajun CHEN.
In Findings of the Association for Computational Linguistics: ACL 2023, pages 2824-2836. (code)

INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation.
Wenhao Zhu, Jingjing Xu, Shujian Huang*, Lingpeng Kong and Jiajun CHEN.
In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15948-15959. (code)

Local Interpretation of Transformer Based on Linear Decomposition.
Sen Yang, Shujian Huang*, Wei Zou, Jianbing Zhang, Xinyu Dai and Jiajun CHEN.
In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 10270-10287. (video, code)

BLEURT Has Universal Translations: An Analysis of Automatic Metrics by Minimum Risk Training.
Yiming Yan, Tao Wang, Chengqi Zhao, Shujian Huang*, Jiajun CHEN and Mingxuan Wang.
In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5428-5443. (video, code)

Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation.
Min Liu, Yu Bao, Chengqi Zhao, Shujian Huang*.
In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI 2023), pages 13246-13254. (code)

CoP: Factual Inconsistency Detection by Controlling the Preference.
Shuaijie She, Xiang Geng, Shujian Huang*, Jiajun Chen.
In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI 2023), pages 13556-13563. (video, code)

Denoising Pre-Training for Machine Translation Quality Estimation with Curriculum Learning.
Xiang Geng, Yu Zhang, Jiahuan Li, Shujian Huang*, Hao Yang, Shimin Tao, Yimeng Chen, Ning Xie, Jiajun Chen.
In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI 2023), pages 12827-12835. (video, code)

2022

Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation.
Jiahuan Li, Shanbo Cheng, Zewei Sun, Mingxuan Wang, Shujian Huang*.
arXiv:2212.08822.

Unsupervised Paraphrasing via Syntactic Template Sampling.
Yu Bao, Shujian Huang*, Hao Zhou, Lei Li, Xinyu Dai, Jiajun Chen.
SCIENTIA SINICA Informationis, Volume 52, pages 1808-1821, 2022 (in Chinese).

Helping the Weak Makes You Strong: Simple Multi-Task Learning Improves Non-Autoregressive Translators,
Xinyou Wang, Zaixiang Zheng*, Shujian Huang*.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 5513-5519, December 7-11, 2022.

FGraDA: A Dataset and Benchmark for Fine-Grained Domain Adaptation in Machine Translation.
Wenhao Zhu, Shujian Huang*, Tong Pu, Pingxuan Huang, Xu Zhang, Jian Yu, Wei Chen, Yanfeng Wang, Jiajun Chen.
International Conference on Language Resources and Evaluation (LREC2022), pages 6719-6727, Marseille, France, 2022.

BiTIIMT: A Bilingual Text-infilling Method for Interactive Machine Translation.
Yanling Xiao, Lemao Liu*, Guoping Huang, Qu Cui, Shujian Huang*, Shuming Shi, Jiajun Chen.
In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1958-1969, Dublin, Ireland, 2022.

latent-GLAT: Glancing at Latent Variables for Parallel Text Generation.
Yu Bao, Hao Zhou, Shujian Huang*, Dongqi Wang, Lihua Qian, Xinyu Dai, Jiajun Chen, Lei Li.
In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 8398-8409, Dublin, Ireland, 2022.

Non-Parametric Online Learning from Human Feedback for Neural Machine Translation.
Dongqi Wang, Haoran Wei, Zhirui Zhang, Shujian Huang*, Jun Xie, Jiajun Chen.
In Proceedings of The Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22), pages 11431-11439.

2021

Duplex Sequence-to-Sequence Learning for Reversible Machine Translation.
Zaixiang Zheng, Hao Zhou*, Shujian Huang, Jiajun Chen, Jingjing Xu, Lei Li.
In Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia.

Learning Kernel-Smoothed Machine Translation with Retrieved Examples.
Qingnan Jiang, Mingxuan Wang, Jun Cao, Shanbo Cheng, Shujian Huang*, Lei Li.
In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 7280-7290, Online and Punta Cana, Dominican Republic.

Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation.
Xin Zheng, Zhirui Zhang, Shujian Huang*, Boxing Chen, Jun Xie, Weihua Luo, Jiajun Chen.
In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 4234-4241 (short paper).

Adaptive Nearest Neighbor Machine Translation.
Xin Zheng, Zhirui Zhang, Junliang Guo, Shujian Huang*, Boxing Chen, Weihua Luo, Jiajun Chen.
In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL2021 Short Papers), pages 368-374, August 1-6, 2021.

When is Char Better Than Subword: A Systematic Study of Segmentation Algorithms for Neural Machine Translation.
Jiahuan Li, Yutong Shen, Shujian Huang*, Xin-Yu Dai, Jiajun Chen.
In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL2021 Short Papers), pages 543-549, August 1-6, 2021.

Non-Autoregressive Translation by Learning Target Categorical Codes.
Yu Bao, Shujian Huang*, Tong Xiao, Dongqi Wang, Xin-Yu Dai, Jiajun Chen.
In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL2021), pages 5749-5759, June 6-11, 2021.

DirectQE: Direct Pretraining for Machine Translation Quality Estimation.
Qu Cui, Shujian Huang*, Jiahuan Li, Xiang Geng, Zaixiang Zheng, Guoping Huang, Jiajun Chen.
In Proceedings of Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI2021), pages 12719-12727.

2020

Toward Making the Most of Context in Neural Machine Translation.
Zaixiang Zheng, Xiang Yue, Shujian Huang*, Jiajun Chen, Alexandra Birch.
In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI2020), Yokohama, Japan, 2021.

Improving Self-Attention Networks with Sequential Relations.
Zaixiang Zheng, Shujian Huang*, Rongxiang Weng, Xinyu Dai, Jiajun Chen.
IEEE/ACM Transactions on Audio, Speech, and Language Processing, Volume 28, pages 1707-1716, 2020.

Mirror-Generative Neural Machine Translation.
Zaixiang Zheng, Hao Zhou, Shujian Huang*, Lei Li, Xin-Yu Dai, Jiajun Chen.
International Conference on Learning Representations (ICLR2020), pages 1-16, Addis Ababa, Ethiopia, 2020. (with Highest Ratings from all reviewers, Oral Presentation(selected))

A Reinforced Generation of Adversarial Examples for Neural Machine Translation.
Wei Zou, Shujian Huang*, Jun Xie, Xinyu Dai, Jiajun Chen.
In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 3486-3497, July 5-10, 2020.

Explicit Semantic Decomposition for Definition Generation.
Jiahuan Li, Yu Bao, Shujian Huang*, Xinyu Dai, Jiajun Chen.
In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 708-717, 2020.

RPD: A Distance Function Between Word Embeddings.
Xuhui Zhou, Zaixiang Zheng, Shujian Huang.
In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics Student Research Workshop, pages 42-50, 2020.

GRET: Global Representation Enhanced Transformer.
Rongxiang Weng, Shujian Huang*, Hao-Ran Wei, Heng Yu, Weihua Luo, Lidong Bing, Jiajun Chen.
The Thirty-Fourth AAAI Conference on Artificial Intelligence, New York, pages 9258-9265, 2020.

Generating Diverse Translation by Manipulating Multi-Head Attention.
Zewei Sun, Shujian Huang*, Hao-Ran Wei, Xin-yu Dai, Jiajun Chen.
The Thirty-Fourth AAAI Conference on Artificial Intelligence, pages 8976-8983, New York, 2020.

Acquiring Knowledge from Pre-trained Model to Neural Machine Translation.
Rongxiang Weng, Heng Yu, Shujian Huang*, Shanbo Cheng, Weihua Luo.
The Thirty-Fourth AAAI Conference on Artificial Intelligence, pages 9266-9273, New York, 2020.

2019

Improving Bilingual Lexicon Induction on Distant Language Pairs.
Wenhao Zhu, Zhihao Zhou, Shujian Huang*, Zhenya Lin, Xiangsheng Zhou, Yaofeng Tu, Jiajun Chen.
In Proceedings of China Conference on Machine Translation (CCMT2019), pages 1-10, Nanchang, China, 2019. Best English Paper Award

Fine-grained Knowledge Fusion for Sequence Labeling Domain Adaptation.
Huiyun Yang, Shujian Huang*, Xinyu Dai, Jiajun Chen.
In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, pages 4188-4197, Hong Kong, China, November 3-7, 2019.

Dynamic Past and Future for Neural Machine Translation.
Zaixiang Zheng, Shujian Huang*, Zhaopeng Tu, Xin-Yu Dai, Jiajun Chen.
In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, pages 930-940, Hong Kong, China, November 3-7, 2019.

Learning Representation Mapping for Relation Detection in Knowledge Base Question Answering.
Peng Wu, Shujian Huang*, Rongxiang Weng, Zaixiang Zheng, Jianbing Zhang, Xiaohui Yan and Jiajun Chen.
In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 6130-6139 Florence, Italy, July 28-August 2, 2019.

Generating Sentences from Disentangled Syntactic and Semantic Spaces.
Yu Bao, Hao Zhou, Shujian Huang*, Lei Li, Lili Mou, Olga Vechtomova, XIN-YU DAI and Jiajun CHEN.
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 6008-6019 Florence, Italy, July 28-August 2, 2019.

Utilizing Non-Parallel Text for Style Transfer by Making Partial Comparisons.
Di Yin, Shujian Huang*, Xin-Yu Dai and Jiajun Chen.
In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19), pages 5379-5386, Macao, China, 2019.

Correct-and-Memorize: Learning to Translate from Interactive Revisions.
Rongxiang Weng, Hao Zhou, Shujian Huang*, Lei Li, Yifan Xia and Jiajun Chen.
In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19), pages 5255-5263, Macao, China, 2019.

Online Distilling from Checkpoints for Neural Machine Translation.
Hao-Ran Wei, Shujian Huang*, Boxing Chen, Ran Wang, XIN-YU DAI and Jiajun CHEN.
In Proceedings of NAACL-HLT 2019, pages 1932-1941, Minneapolis, Minnesota, June 2-June 7, 2019.

2018

Modeling Past and Future for Neural Machine Translation.
Zaixiang Zheng, Hao Zhou, Shujian Huang*, Lili Mou, Xinyu Dai, Jiajun Chen, and Zhaopeng Tu.
Transactions of the Association for Computational Linguistics (TACL), Volume 6, pages 145-157, Melbourne, Australia, 2018.

Combining character and word information in neural machine translationusing a multi-level attention.
Huadong Chen, Shujian Huang*, David Chiang, Xinyu Dai, and Jiajun Chen.
In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 1284-1293, New Orleans, Louisiana, 2018.

Learning to Discriminate Noises for Incorporating External Information in Neural Machine Translation.
Zaixiang Zheng, Shujian Huang*, Zewei Sun, Rongxiang Weng, Xin-Yu Dai, Jiajun Chen.
arXiv:1810.10317, 2018.

Controlling the Transition of Hidden States for Neural Machine Translation.
Zaixiang Zheng, Shujian Huang*, Xin-Yu Dai, Jiajun Chen.
In China Workshop on Machine Translation, pages 86-92. Springer, Singapore, 2018.

2017

Rgraph:Generating reference graphs for better machine translation evaluation.
Hongjie Ji, Shujian Huang*, Qi Hou, Cunyan Yin, and Jiajun Chen.
In China Workshop on Machine Translation, pages 55-67, Springer, Dalian, China, 2017.

Compressing neural networks byapplying frequent item-set mining.
Zi-Yi Dou, Shu-Jian Huang*, and Yi-Fan Su.
In International Conference on Artificial Neural Networks, pages 696-704, Springer, Alghero, Sardinia, Italy, 2017.

Neural Machine Translation with Word Predictions.
Rongxiang Weng, Shujian Huang*, Zaixiang Zheng, Xinyu Dai and Jiajun Chen.
Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 136-145, Copenhagen, Denmark, 2017.

Top-rank Enhanced Listwise Optimization for Statistical Machine Translation.
Huadong Chen, Shujian Huang*, David Chiang, XIN-YU DAI and Jiajun CHEN.
Conference on Computational Natural Language Learning (CoNLL), pages 90-99, Vancouver, Canada, 2017.

AGRA: An Analysis-Generation-Ranking Framework for Automatic Abbreviation from Paper Titles.
Jianbing Zhang, Yixin Sun, Shujian Huang*, Cam-Tu Nguyen, Xiaoliang Wang, Xinyu Dai, Jiajun Chen, Yang Yu.
International Joint Conference on Artificial Intelligence (IJCAI), pages 4221-4227, Melbourne, Australia, 2017.

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder.
Huadong Chen, Shujian Huang*, David Chiang, Jiajun Chen.
Annual Meeting of the Association for Computational Linguistics (ACL), pages 580-585, Vancouver Canada, 2017.

Chunk-based Bi-Scale Decoder for Neural Machine Translation.
Hao Zhou, Zhaopeng Tu, Shujian Huang, Xiaohua Liu, Hang Li and Jiajun Chen.
Annual Meeting of the Association for Computational Linguistics (ACL), pages 1936-1945, Vancouver Canada, 2017. (Short paper).

A Neural Probabilistic Structured-Prediction Method for Transition-Based Natural Language Processing.
Hao Zhou, Yue Zhang*, Chuan Chen, Shujian Huang*, Xin-Yu Dai, and Jiajun Chen.
Journal of Artificial Intelligence Research (JAIR), Volume 58, pages 703-729, 2017.

2016

A Search-Based Dynamic Reranking Model for Dependency Parsing.
Hao Zhou, Yue Zhang, Shujian Huang, Junsheng Zhou, XIN-YU DAI and Jiajun Chen.
In Proceedings of 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), pages 1393-1402, Berlin, Germany, August 7-12, 2016.

Tree-state based Rule Selection Models for Hierarchical Phrase-based Machine Translation.
Shujian Huang, Huifeng Sun, Chengqi Zhao, Jinsong Su, Xinyu DAI and Jiajun Chen.
In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI 2016), pages 2817-2823, New York, USA, July 9-15, 2016.

PRIMT: A Pick-Revise Framework for Interactive Machine Translation.
Shanbo Cheng, Shujian Huang*, Huadong Chen, Xinyu DAI and Jiajun Chen.
In Proceedings of NAACL-HLT 2016, pages 1240-1249, San Diego, California, June 12-17, 2016.

Evaluating a Deterministic Shift-Reduce Neural Parser for Constituent Parsing.
Hao Zhou, Yue Zhang, Shujian Huang, Xin-Yu Dai, and Jiajun Chen.
In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), pages 659-663, Slovenia, Portoroz, May 23-28, 2016.

Adaptation of Language Models for SMT Using Neural Networks with Topic Information.
Yinggong Zhao, Shujian Huang*, Xinyu Dai, and Jiajun Chen.
ACM Transactions on Asian and Low-Resource Language Information Processing (ACM TALLIP), 2016, 15(3): 19:1-19:15.

Enhancing Shift-Reduce Constituent Parsing with Action N-Gram Model.
Hao Zhou, Shujian Huang*, Junsheng Zhou, Yue Zhang, Huadong Chen, Xinyu Dai, Chuan Cheng, Jiajun Chen.
ACM Transactions on Asian and Low-Resource Language Information Processing (ACM TALLIP), 2016, 15(3): 13:1-13:17.

2015

Resolving Coordinate Structures for Chinese Constituent Parsing.
Yichu Zhou, Shujian Huang*, Xinyu Dai, Jiajun Chen.
in Natural Language Processing and Chinese Computing, J. Li et al. (Eds.): NLPCC 2015, LNAI 9362, pp. 353-361, Springer International Publishing, 2015.

A Unified Framework for Jointly Learning Distributed Representations of Word and Attributes.
Liqiang Niu, Xin-Yu Dai, Shujian Huang, and Jiajun Chen.
In Proceedings of 7th Asian Conference on Machine Learning (ACML 2015) November 20-22, 2015, Hong Kong, JMLR: Workshop and Conference Proceedings 45:143-156, 2015

Graph-Based Collective Lexical Selection for Statistical Machine Translation.
Jinsong Su, Deyi Xiong, Shujian Huang, Xianpei Han, Junfeng Yao.
In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP2015), pages 1238-1247, Lisbon, Portugal, 17-21 September 2015.

Non-linear Learning for Statistical Machine Translation.
Shujian Huang, Huadong Chen, Xinyu Dai, Jiajun Chen.
In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL2015), pages 825-835, Beijing, China, July 26-31, 2015.

A Neural Probabilistic Structured-Prediction Model for Transition-Based Dependency Parsing.
Hao Zhou, Yue Zhang, Shujian Huang, Jiajun Chen.
In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (ACL2015), pages 1213-1222, Beijing, China, July 26-31, 2015.

A Synthetic Approach for Recommendation: Combining Ratings, Social Relations, and Reviews.
G. Hu, X. Dai, S. Huang, J. Chen.
In Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015), pages 1756-1762.

Structured sparsity with group-graph regularization.
X. Dai, J. Zhang, S. Huang, J. Chen, and Z. Zhou.
In: Proceedings of the 29th AAAI Conference on Artificial Intelligence (AAAI’15), Austin, TX, 2015

2014

Learning Word Embeddings from Dependency Relations.
Yinggong Zhao, Shujian Huang, Xinyu Dai, Jianbing Zhang, Jiajun Chen.
International Conference on Asian Language Processing 2014 (IALP 2014), October 20-23, 2014, Sarawak, Malaysia.

An Investigation on Statistical Machine Translation with Neural Language Models.
Yinggong Zhao, Shujian Huang, Huadong Chen, and Jiajun Chen.
CCL and NLP-NABD 2014, pp. 175-186, October 18-19, 2014, Wuhan, China.

2013

Hypothesis Pruning in Learning Word Alignment.
HUANG Shujian, DAI Xinyu, CHEN Jiajun.
Chinese Journal of Electronics, 2013 Vol. 22 (CJE-1).

2012

Enhancing Statistical Machine Translation with Character Alignment.
Ning Xi, Guangchao Tang, Xinyu Dai, Shujian Huang, Jiajun Chen.
In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (short paper), Jeju Island, Korea, July 8 - 14, 2012.

2011

Dealing with Spurious Ambiguity in Learning ITG-based Word Alignment.
Shujian Huang, Stephan Vogel and Jiajun Chen.
In ACL:HLT 2011:shortpaper, Portland, Oregon, USA, June 19-24, 2011.

A Syntax-based Pre-reordering Method for Chinese-English Machine Translation.
Qiufeng Wu, Shujian Huang, Xinyu Dai and Jiajun Chen.
In Proceedings of the 12th Chinese National Conference on Computational Linguistics (CNCCL-2011), Luoyang, China, June 2010. (In Chinese)

2010

Improving Word Alignment by Semi-supervised Ensemble.
Shujian Huang, Kangxi Li, Xinyu Dai and Jiajun Chen.
In CoNLL 2010. Uppsala, Sweden, July 11-16, 2010.

2009

Combining ILP and MLN for Coreference Resolution.
Yabing Zhang, Junsheng Zhou, Shujian Huang and Jiajun Chen.
International Conference on Asian Language Processing (IALP 2009), Singapore, Dec 7-9, 2009

Segmenting Long Sentence Pairs for Statistical Machine Translation.
Biping MENG, Shujian Huang, Xinyu Dai and Jiajun Chen.
International Conference on Asian Language Processing (IALP 2009), Singapore, Dec 7-9, 2009

Global Optimization Based On Clustering for Coreference Resolution.
Liu Weipeng, Zhou Junsheng, Huang Shujian and Chen Jiajun.
The 10th Chinese National Conference on Computational Linguistics (CNCCL-2009), Yantai, China, July 24-26, 2009. (In Chinese)

An Error-Sensitive Metric for Word Alignment in Phrase-based SMT.
Shujian Huang, Ning Xi, Yinggong Zhao, Xinyu Dai, Jiajun Chen.
Journal of Chinese Information Processing, 2009, vol. 23, no. 3. (Revised version of CWMT2008 paper, In Chinese)

Coreference Resolution using Markov Logic Networks.
Shujian Huang, Yabing Zhang, Junsheng Zhou, Jiajun Chen.
The 10th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing’2009), Mexico city, Mexico, 2009 poster; in Research in Computing Science: Advances in Computational Linguistics, Alexander Gelbukh Ed., vol. 41, page 157~168, ISSN: 1870-4069. Best Poster Award (1/25)

2007

A New Graph Clustering Algorithm for Chinese Noun Phrase Coreference Resolution.
Junsheng Zhou, Shujian Huang, Jiajun Chen and Weiguang Qu.
Journal of Chinese Information Processing, 2007, vol. 21, no. 2. (In Chinese)

back to the top


Last Update 2024-01-17

说明: 说明: 说明: 说明: 说明: 说明: 说明: Locations of visitors to this page