100
自然语言处理中的多任务学习 NLPCC 2018 Tutorials 邱锡鹏 复旦大学 2018年9月12日 http://nlp.fudan.edu.cn/xpqiu

Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习

NLPCC 2018 Tutorials

邱锡鹏

复旦大学

2018年9月12日

http://nlp.fudan.edu.cn/xpqiu

Page 2: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 2

研究组介绍

主要聚焦于深度学习与自然语言处理领域,包括语言表示学习、词法/句法分析、文本推理、问答系统等方面。

主要成果

近几年发表国际顶级会议/期刊(IJCAI、ACL、AAAI、EMNLP等)论文40余篇,ACL2017杰出论文

开源自然语言处理系统:FudanNLP

Currently, an incubating project: fastNLP

Page 3: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 3

报告概要

自然语言处理简介

基于深度学习的自然语言处理

深度学习在自然语言处理中的困境 无监督预训练

多任务学习

自然语言处理中的多任务学习 硬共享模式

软共享模式

共享-私有模式

函数共享模式

多级共享模式

主辅任务模式

新的多任务基准平台

Page 4: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 4

自然语言处理简介

Page 5: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 5

从人工智能开始

Alan Turing (1912-1954)

自然语言处理:理解和生成

图片来源:https://zhuanlan.zhihu.com/p/31843033

Page 6: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 6

什么是自然语言处理(NLP)?

自然语言≈人类语言

区别于人工语言(比如程序语言)

自然语言处理包括语音识别、自然语言理解、自然语言生成、人机交互以及所涉及的中间阶段。

是人工智能和计算机科学的子学科。

基础技术

• 词法分析

• 句法分析

• 实体识别

• 语义分析

• 篇章分析

• 语言模型

核心技术

• 机器翻译

• 自动问答

• 情感分析

• 信息抽取

• 文本摘要

• 文本蕴涵

应用

• 智能客服

• 搜索引擎

• 个人助理

• 推荐系统

• 舆情分析

• 知识图谱

Page 7: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 7

自然语言处理的难点:歧义性

以中文分词为例

不同的语言环境中的同形异构现象,按照具体语言环境的语义进行切法。

交叉歧义

他/说/的/确实/在理

组合歧义

两个/人/一起/过去、个人/问题

从马/上/下来、马上/就/来

句子级歧义

白天鹅在水里游泳

该研究所获得的成果

伪歧义

Page 8: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 8

发展历程

1990年以前

基于规则的方法

1990年~2012年

基于统计学习的方法

• 离散表示

• 线性模型

2012年~

基于深度学习的方法

• 神经网络

• 分布式表示

• 非线性模型

理解生成

交互

数据驱动

规则驱动

Page 9: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 9

理想中的自然语言处理流程

这是一棵语法树

这 是 一 棵 语法 树

这 是 一 棵 语法 树代词 动词 数词 量词 名词 动词

这 是 一 棵 语法 树代词 动词 数词 量词 名词 动词

分词

词性标注

句法分析

语义分析

这, 是,语法树

应用

语义分析机器翻译自动问答情感分析

… …

知识库

Page 10: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 10

NLP技术路线

Page 11: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 11

基于深度学习的自然语言处理

Page 12: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 12

语言表示

如何在计算机中表示语言的语义?

知识库规则

分布式表示

• 压缩、低维、稠密向量• 用O(N)个参数表示 O(2k)区间

• k为非0参数,k<N

Page 13: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 13

一个生活中的例子:颜色

命名 RGB值

红 [1,0,0]

绿 [0,1,0]

蓝 [0,0,1]

中国红 [0.67, 0.22, 0.12]

咖啡色 [0.64, 0.16,0.16]

Page 14: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 14

词嵌入(Word Embeddings)

https://indico.io/blog/visualizing-with-t-sne/

上海

北京

高兴

难过

Page 15: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 15

分布式表示--来自神经科学的证据

http://www.nature.com/nature/journal/v532/n7600/full/nature17637.html

Page 16: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 16

词嵌入

Socher et al. (2013)

From Mikolov et al. (2013)

W("woman")−W("man") ≃ W("aunt")−W("uncle")

W("woman")−W("man") ≃ W("queen")−W("king")

W("中国")−W("北京") ≃ W("英国")−W("伦敦")

Page 17: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 17

语言表示学习

短语 组合语义模型

句子 连续词袋模型

序列模型

递归组合模型

卷积模型

篇章 层次模型

北京的天气真不错。

Page 18: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 18

为什么语言表示学习更难?

计算机视觉中的深层网络模型

对应NLP的最底层:词汇

152 层

22层

Page 19: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

CCL 2016 中科院张钹院士《后深度学习时代的计算语言学》

Page 20: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 20

自然语言处理的新范式

在得到字、句子表示之后,自然语言处理任务类型划分为

类别(对象)

到序列

文本生成

图像描述

生成

序列到类别

文本分类

情感分析

同步的序列

到序列

中文分词

词性标注

语义角色

标注

异步的序列

到序列

机器翻译

自动摘要

对话系统

减轻了对特征工程的依赖!

Page 21: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 21

序列到类别

输入:序列

输出:类别Sentiment

Analysis

……

我 觉 太得 好 了

超正面

正面

中性

负面

超负面

带着愉悦的心情看了这部电影

这部电影太糟了 这部电影很棒

Positive (正面) Negative (负面) Positive (正面)

……

来源:李宏毅《1天搞懂深度学习》

Page 22: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 22

同步的序列到序列模式

序列标注

Page 23: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 23

语音识别

Connectionist Temporal Classification (CTC) [Alex Graves,

ICML’06][Alex Graves, ICML’14][Haşim Sak, Interspeech’15][Jie Li, Interspeech’15][Andrew Senior, ASRU’15]

好 φ φ 棒 φ φ φ φ 好 φ φ 棒 φ 棒 φ φ

“好棒” “好棒棒”

来源:李宏毅《1天搞懂深度学习》

Page 24: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 24

信息抽取

Page 25: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 25le

arn

ing

异步的序列到序列模式

机器翻译

ma

ch

ine

机 习器 学 。

来源:李宏毅《1天搞懂深度学习》

Page 26: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 26

看图说话

Page 27: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 27

One Size fits ALL!

Page 28: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 28

One Size fits ALL!

Page 29: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 29

深度学习在自然语言处理中的困境

Page 30: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 30

深度学习在自然语言处理中的“困境”

What• Not Truly Deep

• One layer LSTM + Attention is enough for most NLP tasks.

Why• 缺少大规模的标注数据

• 标注代价太高

How• 无监督预训练

• 多任务学习

Page 31: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 31

无监督预训练

Page 32: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 32

无监督预训练

词级别

语言模型

Word2Vec (CBOW and Skip-Gram)

GLOVE

FastText

ELMo: Embeddings from Language Models

句子级别

Skip-Thought

Paragraph Vector

Page 33: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

NAACL 2018 Best Paper

ELMo: Embeddings from Language Models

https://arxiv.org/pdf/1802.05365.pdf

Page 34: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 34

多任务学习

Page 35: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 35

单任务学习

Page 36: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 36

多任务学习 Multitask Learning

Page 37: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

Multitask Learning is an approach to inductive transfer that improves generalization by using the domain information contained in the training signals of related tasks as an inductive bias. It does this by learning tasks in parallel while using a shared representation; what is learned for each task can help other tasks be learned better.

Machine Learning 1997

Page 38: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 38

A NLP Example

Page 39: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

MTL NLPToo Hard

MTL NLP DL Perfect

Page 40: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 40

多任务学习+深度学习

Neural network based approaches make MTL particulary attractive/easy

Page 41: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 41

不同学习范式之间的关系

迁移

学习

多任务

学习

多标签

学习

多类

学习

迁移学习 Transfer Learning

在源领域上学习模型

泛化到目标领域上

多任务学习 Multi-Task Learning

同时建模多个相关任务

不同任务有不同的数据和标签

多标签学习 Multi-Label Learning

一个样本可以有多个标签

建模标签之间的关系

多类学习 Multi-Class Learning

一个样本只能属于一个标签

Page 42: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 42

损失函数

损失函数

Joint Losses?

Page 43: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 43

训练方式

Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task.

2. Select a random training example from this task.

3. Update the parameters for this task by taking a gradient step with respect to this example.

4. Go to 1.

Fine Tuning: After the joint learning phase, we can use a fine tuning strategy to further optimize the performance for each task.

Page 44: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 44

Why does MTL work?

隐式的数据增强

更好的表示学习 一个好的表示需要能够提高多个任务的性能。

正则化 共享参数在一定程度上弱化了网络能力,防止过拟合

Eavesdropping(窃听)

Page 45: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 45

自然语言处理中的多任务学习

Page 46: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 46

思维方式的改变

输入 词法分析 句法分析 任务

输入 共享模块

词法分析

句法分析

任务

Page 47: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 47

自然语言中的多任务学习

• Multi-Domain Text Classification

• Multi-Domain Sentiment Analysis

多领域(Multi-Doman)任务

• part-of-speech (POS)

• tagging, named entity recognition (NER)

• semantic role labeling (SRL)

多级(Multi-Level)任务

• Machine translation

• Multi-lingual parsing

多语言(Multi-Linguistic)任务

• Visual QA

• Image Caption

多模态(Multi-Modality)任务

Page 48: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 48

深度学习+多任务学习

(a)硬共享模式 (b) 软共享模式 (c) 共享-私有模式

Page 49: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 49

共享模式

硬共享模式

软共享模式

共享私有模式

多级共享模式

函数共享模式

主辅共享模式

Page 50: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 50

硬共享模式

Page 51: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

ICML 2008

Page 52: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

http://www.aclweb.org/anthology/N15-1092

NAACL 2015

Page 53: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

ACL 2015

end to multi-end model

Page 54: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

IJCAI 2016

https://arxiv.org/pdf/1605.05101.pdf

Page 55: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://arxiv.org/pdf/1804.08139.pdf

IJCAI 2018

The infantile cart is easy to use,

Page 56: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://arxiv.org/pdf/1804.08139.pdf

IJCAI 2018

共享表示,不同注意力选择

Page 57: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

Similarity Matrix of Different Task’s query vector

IJCAI 2018

Page 58: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

IJCAI 2018

Page 59: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

ICLR 2018

MULTI-TASK SEQUENCE-TO-

SEQUENCE LEARNING

Page 60: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 60

软共享模式

Page 61: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

CVPR 2016 (Spotlight)

https://arxiv.org/pdf/1604.03539.pdf

cross-stitch

Page 62: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

Sluice networks

arXiv preprint arXiv:1705.08142

Page 63: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

COLING 2018

αi−1 is updated by back-propagation, which reflects the

strength of association between tasks but has no

selection for the features.

http://www.aclweb.org/anthology/C18-1175

Page 64: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

COLING 2018

Leaky Unit

Page 65: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

COLING 2018

Multi-task and multi-lingual partially-shared modeling

http://www.aclweb.org/anthology/C18-1304

Page 66: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 66

共享-私有模式

Page 67: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://aclweb.org/anthology/D16-1012

EMNLP 2016

神经图灵机

外部记忆共享机制

Page 68: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task
Page 69: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

ACL 2017

http://aclweb.org/anthology/P/P17/P17-1001.pdf

Two sharing schemes for task A and task B.

The overlap between two black circles denotes shared space.

The blue triangles and boxes represent the task-specific features

The red circles denote the features which can be shared.

The infantile cart is simple and easy to use.

This kind of humour is infantile and boring.

Page 70: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 70

对抗学习

Page 71: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 71

多任务学习数据集

Page 72: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 72

结果

Page 73: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

ACL 2017 Outstanding Paper

http://aclweb.org/anthology/P/P17/P17-1110.pdf

Page 74: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

ACL 2017 Outstanding Paper

http://aclweb.org/anthology/P/P17/P17-1110.pdf

Page 75: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 75

函数共享模式

Page 76: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://arxiv.org/pdf/1802.08969.pdf

AAAI 2018

元学习:学习的学习

Page 77: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

共享特征

共享函数

Page 78: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 78

多级共享模式

Page 79: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

http://anthology.aclweb.org/P16-2038

ACL short 2016

Page 80: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

EMNLP 2017

Page 81: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 81

主辅任务模式

Page 82: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

ACL 2017

https://arxiv.org/pdf/1704.07156.pdf

Page 83: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

EMNLP 2016

u indicates whether the original sentence x contains at

least one domain independent positive sentiment

word.

v indicates whether x contains at least one domain

independent negative sentiment word.

http://www.aclweb.org/anthology/D/D16/D16-1023.pdf

Page 84: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

EMNLP 2017

http://www.aclweb.org/anthology/D17-1223

Page 85: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

EMNLP 2017

http://www.aclweb.org/anthology/D17-1223

Page 86: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

LSTM for discourse argumentMulti-task Attention-based

Neural Networks

EMNLP 2018

Page 87: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 87

共享模式搜索

Page 88: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://arxiv.org/pdf/1808.07658.pdf

Page 89: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 89

面向NLP的神经网络架构搜索

Page 90: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 90

面向NLP的神经网络架构搜索

自动选择的共享模式 自动选择过程

Page 91: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 91

新的多任务基准平台

Page 92: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 92

机器阅读

阅读一篇或多篇文档,并回答一些相关问题。

Question:

Officials say the driver, 24-year-old Tavon Watson,

lost control of a _____.

Page 93: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 93

Bidirectional Attention (Seo et al.,2016)

Page 94: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://arxiv.org/pdf/1806.08730.pdf

http://decanlp.com

Page 95: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://arxiv.org/pdf/1806.08730.pdf

Page 96: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://arxiv.org/pdf/1806.08730.pdf

Page 97: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://arxiv.org/pdf/1804.07461.pdf

Page 98: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

https://gluebenchmark.com/

General Language Understanding Evaluation (GLUE) benchmark

Page 99: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 99

总结

自然语言处理简介

基于深度学习的自然语言处理

深度学习在自然语言处理中的困境 无监督预训练

多任务学习

自然语言处理中的多任务学习 硬共享模式

软共享模式

共享-私有模式

函数共享模式

多级共享模式

主辅任务模式

新的多任务基准平台

Page 100: Personal Branding Roadmap Template - GitHub Pages · 2020-07-11 · Joint Training: The training is achieved in a stochastic manner by looping over the tasks: 1. Select a random task

自然语言处理中的多任务学习邱锡鹏 (复旦大学) 100

谢 谢