22
Summarization for Dragon Star Program (Renmin Univ, Beijing, 5.21~5.27, 2012) Yueshen Xu [email protected] Zhejiang University ZJU 05/16/22

Summarization for dragon star program

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Summarization for dragon  star program

Summarization for Dragon Star Program

(Renmin Univ, Beijing, 5.21~5.27, 2012)

Yueshen [email protected]

Zhejiang University

ZJU04/10/23

Page 2: Summarization for dragon  star program

Overview

Narration What they addressed

Program Profile Knowledge and Expertise

Argumentation What I think over

Research and Research Mode Potpourri

Discussion

04/10/23 ZJU

No Dazzle

No Dazzle

Page 3: Summarization for dragon  star program

Organizer and Lecturer

Organizer Lecturer

04/10/23 ZJU

An amiable

lady

CuiPing Li

Jun He

Prof. Qiang Yang, HKUST

• Classification• Transfer Learning

Prof. Jiawei Han, UIUC

• Network Model• Relationship Mining over DBLP

Prof. Liu Huan, ASU

• Online Group Behavior over Social Network

Prof. Jian Pei, SFU

• Mining on Uncertain Data

guest appearance

Page 4: Summarization for dragon  star program

Curriculum

04/10/23 ZJU

Contents Mainly about Data Mining A little about machine learning and database

Base + Advance Base: All should know Advance: Only a few know

Syllabus  Tight and tired

Participation On time, in time and full time

Prof. Liu

6:30

Page 5: Summarization for dragon  star program

Attention

No comment, no guess, just what it’s what No topics, no transformation and no speculation No detail, just summarization Further study resource repository

http://www.cse.ust.hk/~qyang/2012DStar/ http://www.cs.uiuc.edu/~hanj/dragon12/info12.htm Ask for me Ask for me all is OK

04/10/23 ZJU

• What they told me are summarization• Digest not too much• Learn it for needing it

• What you research is to what you meet.

• No qualification

Page 6: Summarization for dragon  star program

Prof. Yang Classification & Transfer Learning

Classification Decision Trees Neural Networks

Replaced by SVM Bayesian Classifiers

Conditional Independence Naïve Bayesian Network

Support Vector Machines Little about why, mainly about what

Ensemble Classifiers Bagging and Boost (Ada boost) Random Forest

Collaborative Filtering A little

04/10/23 ZJU

Prof. Yang, can you speak a little faster?

Just Summarization, little detail

Page 7: Summarization for dragon  star program

Prof. Yang Classification & Transfer Learning

Transfer Learning What he and his students good at and maybe only good at

04/10/23 ZJU

Page 8: Summarization for dragon  star program

Prof. Yang Classification & Transfer Learning

I don’t know, but I can bamboozle you Transfer Learning

The ability of a system to recognize and apply knowledge and skills learned in previous tasks to novel tasks or new domains

Easy to talk, hard to do

04/10/23 ZJU

Page 9: Summarization for dragon  star program

Prof. Yang Classification & Transfer Learning

What they focus on Heterogeneous Transfer Learning Source-free selection transfer learning Multi-task transfer learning Transfer Learning for Link Prediction EigenTransfer: A Unified Framework for Transfer Learning

04/10/23 ZJU

Page 10: Summarization for dragon  star program

Prof. Han Information Network Model & Relationship Mining over DBLP

An amiable and rigorous old senior He is involved in the whole process of each paper, ‘Cause he knows

details well He would like to answer every questions Never acting superior

Information Network Model: Great powers of conception Fundamental theory of network analysis Not just about social network. Take a glance at Prof. Han’s contents:

─ Network Science

─ Measure of Metrics of Networks

─ Models of Network Formation

04/10/23 ZJU

Page 11: Summarization for dragon  star program

Prof. Han Information Network Model & Relationship Mining over DBLP

Network Science Plentiful Social network

Social network example Friendship networks vs. blogosphere

Other Network Communication Network Biological Network

04/10/23 ZJU

Network model and their representation

Too many, just list some:

• PageRank, Bipartite Networks

Models of Network Formation Explain how social networks

should be organized Model the graph generation

process of social networks Probabilistic Distribution Power Law Long tail law The Erdös-Rényi (ER) Model The Watts and Strogatz Model

Page 12: Summarization for dragon  star program

Prof. Han Information Network Model & Relationship Mining over DBLP

All based on DBLP Why? ‘Cause it’s heterogeneous networks Clustering, Ranking in information networks

Problems What they mine

04/10/23 ZJU

Page 13: Summarization for dragon  star program

Prof. Han Information Network Model & Relationship Mining over DBLP

Classification of information networks Is VLDB a conference belonging to DB or DM?

Similarity Search in information networks DBLP

Who are the most similar to “Christos Faloutsos”? IMDB

Which movies are the most similar to “Little Miss Sunshine”? E-Commerce

Which products are the most similar to “Kindle”?

Y. Sun, J. Han, X. Yan, P. S. Yu, and Tianyi Wu, “PathSim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks”, VLDB'11

04/10/23 ZJU

Page 14: Summarization for dragon  star program

Prof. Han Information Network Model & Relationship Mining over DBLP

What they take advantage of? Network Schema, called Meta-Path, take an example:

04/10/23 ZJU

Page 15: Summarization for dragon  star program

Prof. Han Information Network Model & Relationship Mining over DBLP

Relationship Prediction in Information Networks Whom should I collaborate with? Which paper should I cite for this topic? Whom else should I follow on Twitter? Y.Sun, R.Barber, M.Gupta, C.Aggarwal and J.Han. “Co-author Relationship

Prediction in Hererogeneous Bibliographic Networks”, ASONAM’11, July 2011

Role Discovery: Extraction Semantic Information from Links

Ref. C. Wang, J. Han, et al., “Mining Advisor-Advisee Relationships from Research Publication Networks”, SIGKDD 2010

Data Cleaning and Trust Analysis by InfoNet Analysis Xiaoxin Yin, Jiawei Han, Philip S. Yu, “Truth Discovery with Multiple Conflicting

Information Providers on the Web”, TKDE’08

04/10/23 ZJU

Page 16: Summarization for dragon  star program

Prof. Han Information Network Model & Relationship Mining over DBLP

Automatic discovery of Entity Pages (T. Weinger, Jiawei Han et al. WWW’11) Given a reference page, can we find entity pages of the same

Type?

14 pages references

04/10/23 ZJU

Page 17: Summarization for dragon  star program

Prof. Pei Uncertain Data Mining Mining uncertain data Probability is vital

Models and Representation of uncertain data Mining Frequent Patterns Classification Clustering Outlier Detection

Topic-Oriented Nothing to do with database, namely nothing to do with query Learn yourself Outlier Detection on uncertain data is a challenge This is what I most concern about from point view of knowledge

04/10/23 ZJU

Page 18: Summarization for dragon  star program

Our Thoughts

As for pure research, there is no speculation What’s the proper mode for research?

Method-Oriented: Prof. Yang

All about transfer learning

All I have to do is solve practical problems with transfer learning, eg. Link predication.

Application-Oriented: Prof. Han

Find fun in DBLP, all about relationship mining

Every part of Prof. Han’s method is not new, but leading by the problem, the whole framework is innovative

Topic-Oriented: Prof. Pei

Clustering and outlier detection on uncertain data

He and his team is dependent on solid accumulation

04/10/23 ZJU

Page 19: Summarization for dragon  star program

Our Thoughts

How do they do research? Accumulation Real world problem Valuable research problem

Discuss and test to find a suitable method Experiment Paper

Accumulated by means of imitation Not just scan ppt, but do experiments others had did Solve problems others had solved

Different field, different mode Application-Oriented: flexible Method-Oriented: mathematics Topic-Oriented: accumulation

Work as a Team

04/10/23 ZJU

Is the problem valuable? Can it be solved by us?

Test again and again. Accumulation, experience, judgment….

Experience and hard work

Revise many times

Page 20: Summarization for dragon  star program

Our Thoughts

Prof. Pei: Small data Can you learn a model just with a little data? Data collection is very costly Since you can know what you want using 1GB, why do you use

1TB with so many machines? Prof. Pei: do we really need experiments? No, provided that what

you have done is really convictive./ Yes, ‘cause our job is not convictive enough.

Read every helpful paper Research should be labeled by researchers, their teams

and their labs. Everyone has his own pan, not that all guys just have one.

04/10/23 ZJU

Page 21: Summarization for dragon  star program

Our Thoughts

20/80 Law I have fallen behind from others I had lost myself in clouds of research for one year. I

hope I can find my way.

04/10/23 ZJU

Page 22: Summarization for dragon  star program

DiscussionDiscussion

04/10/23 ZJU