20

实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

  • Upload
    others

  • View
    7

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

附 录

Page 2: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,
Page 3: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

实验室研讨会

2014.12.30 Streaming Processing & Short Text Processing

Yantao Gan

(Cloud Group)

Whence and Whither streaming -- Introduction to Messaging System

Abstract:

It became clear that real-time query processing and stream processing is the

immediate need in many practical applications.How to ingest large amounts

of stream data from many different sources to a stream processing system

like Storm, messaging system is playing an important role. In this report, we

introduced some popular data ingestion tools including Flume, Scribe,

Sqoop, Chukwa, RabbitMQ, Kafka and SpringXD.

Hehan Li

(Web Group)

Semantic Hierarchy Learning and Paraphrase Dictionary Building over

Knowledge Base

Abstract:

Semantic hierarchy learning, also named hypernym-hyponym relation

detection.We conbine the method of learning semantic hierarchy with the

need of our recent word and propose a new method to build the paraphrase

dictionary for knowledge base.

2014.12.23 Clickstream Network & Entity Linking

Li Yong

(Web Group)

Scaling Behaviors of Weighted Clickstream Network

Abstract:

The availability of big data,such as those from human online surfing

records,makes it possible to probe into and quantify the regular pattern of

user long range, complex interactions between Web sites.We construct a

clickstream network, whose nodes were websites and edges were formed by

the users’switching between sites.By analyzing the circulation of the

collective attention we discover the scaling relationship between the impact

of sites and their traffic.

Kejun Zhao

(Web Group)

Entity Linking with a Knowledge Base

Abstract:

Plenty of data on the Web is in the form of natural language. Bridging Web

data with knowledge bases is beneficial for many aaplications. This seminar

talks about the key issues, techniques and solutions of entity linking with a

knowledge base and gives a few ideas on future work.

2014.12.16 Flash Caching

Jiangtao Wang

(CloudGroup)

Enhancing the Performance of Database Applications with Flash

Caching

Abstract:

SSD and HDD exhibit different retrieval cost. HDDs are cost effective for

infrequently accessed data, and SSDs are well-suited to data that are

Page 4: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

relatively hot. Using flash memory as the extended cache can reduce the

performance gap between DRAM and HDD.We discuss how to use flash

accelate the peromance of database systems.

Zhiliang Guo

(Cloud Group)

Enterprise flash - the development and applications.

Abstract:

Enterprise flash device is more durability, and has higher performance than

consumer flash. Also, it has good write performance and delay jitter. In this

report,I introduced the development and 2 latest products of enterprise flash

devices.

2014.12.09 Web Data Management

Wen Mei Wu

(Cloud Group)

VLDB2014 Overview

Abstract:

This report gives an overview of VLDB2014,which includes three keynotes

and the research papers.Moreover,It introduces two papers of this

conference.One paper provides some kinds of sort and join algorithms for

persistent memory.Another paper considers the storage Management in the

NVRAM Era.

Wang Lu

(Web Group)

Uniqueness privacy preserving in location data publication

Abstract:

During location data publication, the uniqueness issue may reveal sensitive

information such as personal profile, policy affilication to attackers. In this

paper, we investigate the uniqueness issue in location data and propose a

solution to preserve uniqueness and thus protect user's sensitive information.

Zhijuan Du

(Web Group)

Plan Bouquets: Query Processing without Selectivity Estimation

Abstract:

The article propose a new approach for selectivity estimates problem of

OLAP queries, wherein the compile-time estimation process is completely

eschewed for error-prone selectivities. Instead, a small “bouquet” of plans is

identified from the set of optimal plans in the query’s selectivity error space,.

Then, at run time, the actual selectivities of the query are incrementally

“discovered” through a sequence of partial executions of bouquet plans,

eventually identifying the appropriate bouquet plan to execute. The duration

and switching of the partial executions is controlled by a graded progression

of isocost surfaces projected onto the optimal performance profile.

2014.12.06 R-Store & Redis

Yankee Fu

(Cloud Group)

R-Store: A Scalable Distributed System for Supporting Real-time

Analytics

Abstract:

It is widely recognized that OLTP and OLAP queries have different data

access patterns, processing needs and requirements. Hence, the OLTP queries

and OLAP queries are typically handled by two different systems, and the

data are periodically extracted from the OLTP system,transformed and

Page 5: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

loaded into the OLAP system for data analysis. With the awareness of the

ability of big data in providing enterprises useful insights from vast amounts

of data, effective and timely decisions derived from real-time analytics are

important. It is therefore desirable to provide real-time OLAP querying

support, where OLAP queries read the latest data while OLTP queries create

the new versions.

Zhiliang Guo

(Cloud Group)

Introduction of design principles and applications of Redis.

Abstract:

Redis is an open source, BSD licensed, advanced key-value cache and store.

It is often referred to as a data structure server since keys can contain strings,

hashes, lists, sets, sorted sets, bitmaps and hyperloglogs.

2014.11.29 Query Understanding

Hehan Li

(Web Group)

Query Understanding over Knowledge Base

Abstract:

As the popularity of knowledge base,how to retrieve it in a more efficient

and accurate way has become a hot topic.It faces three main

challenges:(1)Ambiguity,(2)Coverage,(3) Scale.This report makes an

introduction to both keyword query and natural language query,and makes

conclusion and comparison about methods to these two kinds of query.

2014.11.18 Stream Partitioning & Data Processing to Smart SSDs

Chunkai Wang

(Cloud Group)

Adaptive Stream Partitioning Based on Temporal Positive Correlation

Abstract:

We proposed a method of adaptive stream partitioning based on temporal

prositive correlation. Firstly, we can get the max partition set based on the

user's query at compiling time. Secondly, we can merge partition keys by

computing the temporal positive correlation at running time. Finally, we

introduce the dynamic partitioning based on the density of grids for

improving the robustness of the method.

Jiangtao Wang

(CloudGroup)

Offloading Data Processing to Smart SSDs

Abstract:

Solid state drive (SSD) has emerged as a new kind of secondary storage

medium. The transfer cost is expensive when handle large-scale data sets.

Moving code to data is far more efficient than moving data to code. The

computing capability of SSDs becomes more powerful.We discuss how to

use smart ssd accelate the process of data-intensive applications.

2014.11.11 Stream Partitioning & Private Information Retrieval

Chunkai Wang

(Cloud Group)

Research on Data Stream Partitioning Strategies

Abstract:

Aiming at the distributed processing platform, we need to partition stream

data for improving the processing speed of the systems based on the user's

query. We inroduced three kinds of partition strategies: query-aware

Page 6: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

partitioning, TAD-based partitioning, correlation-aware partitioning. Finally,

we summarized the pros and cons of the three strategies.

Wang Lu

(Web Group)

Technology of Private Information Retrieval

Abstract:

This talk introduces methods that offer strong location privacy, by integrating

private information retrieval functionality.

2014.11.02 Big Data mining & SK Queryv

Du Zhi Juan

(Web Group)

A Survey on Big Data mining in Microblog

Abstract:

First of all,we mainly analys the content and background characteristics of

Micro-blog,and then obtains 1H-2S-3M-4V characteristics of Micro-blog

data. Secondly,we mainly analys related researches from the social attribute

mining and content mining. Finally, we explore challenge of mining

Micro-blog data and the new problems from social demands according to the

10 characteristics of micro-blog data.

Shengna Guo

(Mobile Group)

A survey of Spatial-Keyword(SK) Query

Abstract:

Geo-textual indices play an important role in spatial keyword querying. The

existing geo-textual indices have not been compared systematically under the

same experimental framework. This makes it difficult to determine which

indexing technique best supports specific functionality.

2014.10.28 Web Data Management

Li Yong

(Web Group)

,they are often driven by

interests.Despite recent efforts in exploring the behavioral targeting and

Web user interests mining, little is known about regular pattern of

human-interest process.The availability of big data,such as those from

human online surfing records,e-commerce and communication records,

makes it possible to probe into and quantify the dynamics of human-interest

behaviors.These data are called “small data” in the era of big data.In this

presentation, we introduce some new thoughts to mining online behavior

from these small data.

Kejun Zhao

(Web Group)

Short text understanding

Abstract:

Short text understanding is a hot yet challeging task. Different with

traditional full text, short text is often considered as lack of syntactic features

and context, so the traditional approaches including parsing, chunking, entity

recognition and disambiguation, are most not applicable in this situation.

This seminar gives an introduction to the problems, challeges and main

techniques of short text understanding,and also some hot machine learning

Page 7: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

models.

2014.10.21 Latest Exchange Report

Yu Zhang

(Cloud Group)

Report of WI2014

Abstract:

The 2014 IEEE/WIC/ACM International Conference on Web

Intelligence(WI2014) was held in Warsaw, Poland on August 11-14,2014. It

received 242 paper submissions for the research track. The research program

features 85 papers, and the acceptance rate for regular papers is 35.1%. Thr

conference program also includes 8 keynotes, 7 tutorials and 4 panels.

Li Yong

(Web Group)

Introduction of 2014 massive data seminar in Hong Kong

Abstract:

To further promote the research cooperation of Chinese mainland and Hong

Kong, the National Science Foundation of China(NSFC) and the Chinese

University of Hong Kong(CUHK) jointly organized symposium in Hong

Kong,September 23-24,2014. The seminar theme is:massive data

management. In this presentation, we introduce some new ideas about Big

Data in this symposium.

Chunkai Wang

(Cloud Group)

2014 Academic symposium on Big Data

Abstract:

Introduction to several seminars in this academic symposium. Focusing on

explaining the special report "One-Pass AUC Optimization". Fanally, there is

the photo show.

Lu Wang

(Mobile Group)

Report on visit Hong Kong Baptist University

Abstract:

The report is focus on research progress and experience in Hong Kong

Baptist University.

2014.10.18 Big Data Management: Data Skew and Bootstrap

Xiang Ci

(Cloud Group)

Data Skew in MapReduce-based Systems

Abstract:

Data skew is inevitable in distributed systems. In this talk, i will introduce

the definition of data skew, the types of data skew and the basic approaches

for solving data skew. I will also summarize papers which are related with

this topic in recent years.

Fengming Wang

(Cloud Group)

Bootstrap: a simulated-based statistical method

Abstract:

The statistical inference is very difficult.One difficulty is that in practice,

outside of certain properties like the mean, it can be extremely difficult to

infer characteristics of a distribution. Motivated by these sorts of problems

and facilitated by the advent of inexpensive computational power, the last 25

years have seen the widespread acceptance of experimental or

simulation-based confidence bounds in statistics.

Page 8: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

2014.05.30 Storage Management

Jiangtao Wang

(FlashGroup)

Impact of SSD on Different Workload Application

Abstract:

Solid state drive (SSD) has emerged as a new kind of secondary storage

medium. Many applications has used SSD to completely replace HDD as its

main storage.An interesting research question: “what is the impact of SSD

on the different workload application ?" In this presentation,we introduce

two work:Multi-tenancy OLTP application and Search Engine.We discuss the

key factor when designing SSD-based storage system.

Wenyu Lai

(Flash Group)

Implementation Technologies of Storage Management in PostgreSQL

Abstract:

PostgreSQL is an advanced object-relational database system. In this talk,

We will briefly introduce the implementation technologies of storage

management in PostgreSQL .

2013.05.24 Deep Learning and Privacy on Big Data

Wang Lu

(Web Group)

Exloiting service similarity to privacy in location-based search queries

Abstract:

This paper proposes a user-centric location-based service architecture where

a user can observe the impact of location inaccuracy on the service accuracy

before deciding the geo-coordinates to use in a query. It also constructs a

local search application based on this architecture.

2014.05.16 Short Text Understanding

Zhao Kejun

(Web Group)

Short Text Understanding

Abstract:

Natural language processing is always a hot topic, especially the symantics in

the text has received much attention. Short text, because of its lack of syntax

and context, makes the understanding more challenging. This seminar talks

about some recent work on this area, and providing some interesting topics.

2014.04.11 Web Data Management

Gan Yantao

(Cloud Group)

Report of ICDE2013

Abstract:

The 20th IEEE International Conference on Data Engineering(ICDE2013)

was held in Brisbane, QLD, Australia, April 8-11, 2013. ICDE2013 received

443 paper submissions for the research track, 20 submissions for the

industrial track, and 69 demo proposals. The research program features 95

papers, the industrial program 8 papers, and the demonstration program 27

demos. The conference program also includes 3 keynotes, 9 seminar tutorials

and one panel.

Wang Lu

(Web Group)

Utility Centric Sensitive Data Publication via Partition

Abstract:

Page 9: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

Most work in privacy-aware data sharing has considered disclosing

summaries where the aggregate information about the data is preserved.We

consider a new data sharing paradigm and introduce the problem of

privacy-aware data partitioning.The data should be distributed so that an

adversary, without colluding with other adversaries, can not draw additional

inferences about the private information.

Miao Wang

(Web Group)

HTML5 Head First

Abstract:

Intrduce the background and some interesting attributes in HTML5

2014.04.04 Data management on New Storage

Jiangtao Wang

(FlashGroup)

cost-aware data management for Phase Change Memory

Abstract:

Storage systems based on Phase Change Memory (PCM) devices are

beginning to generate considerable attention in both industry and academic

communities. PCM has high potential as a new component for enterprise

storage systems in a multi-tiered environment. Our presentation describes the

evaluating phase change memory for tnterprise storage systems.

Zhiliang Guo

(Flash Group)

An Introduction to OceanBase

Abstract:

OceanBase is a extendable relational database system. It is designed by

Alibaba Group.This report introduces the architecture and design techniques

of OceanBase.

2014.03.28 Introduction to RDF Storage and Query

Hehan Li

(Cloud Group)

Introduction to RDF Storage and Query

Abstract:

RDF is a comprehensive describe resources framework to promote the

automated processing of network and has been widely use in recent years.

Although its triple structure is easy to understand,the study of RDF can

derive a lot of problems,such as RDF organization based on relational

database,the retrieval algorithm on RDF graph and so on.These are hot issues

recently.This report gives a brief introduction to the RDF background

knowledge and the organization and query of relational database based,triple

based and graph based RDF.

2014.03.14 Web Data Management

Li Yong

(Web Group)

User's online Behavior Data Mining

Abstract:

User's online behavior information plays an important role in personalized

web applications.However, it is usually not easy to obtain this kind of

personal behavior data. In this presentation, we introduce two algorithms to

predict users emographic attributes from their online browsing behaviors.

Ruxia Ma (Web Truth Discovery on Deep Web

Page 10: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

Group) Abstract:

Information available on the Web is abundant but often inaccurate. Different

information sources publish information with different degree of correctness.

For a novice user, it is not easy to identify incorrect data. So, how to find the

truths of facts is an important issue. Recently there have been a lot of works

that evaluate the truthfulness of facts and the sources. Here, we analysis the

works about truth discovery on Deep Web.

4.03.07 Data Stream Processing & Extreme Value Theory

Chunkai Wang

(Cloud Group)

Data Stream Processing Languages

Abstract:

With the rise of the applications of data streams, the data stream processing

languages also emerge focusing on different platforms. This report introduces

4 processing languages: Stanford-CQL, IBM-SPL, StreamBase-StreamSQL

and DBT-SQL, and contrasts to explain. Finally, expatiates the articherure

and challenges of the PQSAL that is developing in our lab.

Xiang Ci

(Cloud Group)

An Introduction to Extreme Value Theory

Abstract:

Traditional statistics focus on the majority of data, but in many applications,

the long tail of dataset might be much more valuable. The extreme value

theory is interested in modelling the tail of dataset and analyzes features of

these extreme values. This report gives a simple introduction of extreme

value theory.

2014.01.10 System research report: Graphlab, spark, Pregel & Hama

Xiaojian Zhang

(Web Group)

graphlab + spark

Abstract:

This report mainly introduces graphlab and spark from the following points:

system architecture, function module, implementation, etc. Besiders,

compared to other similar systems, we give a series of analysis.

Ruxia Ma

(Web Group)

Large-Scale Graph Processing Systems: Pregel and Hama

Abstract:

With the coming of the big data age, many practical computing problems

concern large graphs. The technologies of graph processing has developed

for a long time. But with the development of information technology and the

explosion of information, the scale of graphs is growing. Efficient processing

of large graphs is challenging. In this report, we will introduce tow systems

for large-scale graph processing: pregel and hamma.

2014.01.03 Protecting uniqueness in human mobility

Lu Wang

(Web Group)

Protecting uniqueness in human mobility

Abstract:

As coarse datasets providing little anonymity, new protect frameworks need

to be design to protect the privacy of individuals.

Page 11: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

实验室网站

http://idke.ruc.edu.cn/

Page 12: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

实验室成员

Faculty Members

Meng Xiaofeng

孟小峰

博士 教授 博导

WAMDM 实验室负责人

Cao Wei

曹巍

博士 讲师

Ph.D. Candidates

Ma Ruxia

马如霞

Ci Xiang

慈祥

Wang Jiangtao

王江涛

Wang Lu

王璐

Li Yong

李勇

Wang Chunkai

王春凯

Du Zhijuan

杜治娟

Wu Wenmei

吴文妹

M.Sc. Students

Zhang Yu

张瑜

Gan Yantao

干艳桃

Guo Zhiliang

郭志良

Wang Fengming

汪凤鸣

Zhao Kejun

赵可君

Li Hehan

李和瀚

Guo Shengna

郭胜娜

Fu Yanjun

符彦军

Page 13: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

------------------------------------------------------------------------------------------------------

杜治娟 2014 级博士研究生

学生时代,能进入这个与党和国家同呼吸、共命运的历史名校——中

国人民大学学习,我感到无尚的光荣;能师从孟老师学习,倍感荣幸;能

与 WAMDM 实验室的各位同窗们相会于此,更是一种缘分,就让我们在

未来的几年中互励互勉,携手共进。

这里藏龙卧虎,人才济济,有着浓厚的学术氛围,加上优良的学术传

统,培养出了一批批优秀的人才。古人云:欲穷千里目,更上一层楼。从导师指导研究微博

数据挖掘、空间数据、web 数据集成到大数据集成都显现了老师独特的教学方法与教学理念。

在老师的悉心指导下,我必能更进一步。

祝愿我们的明天更美好!

吴文妹 2014 级博士研究生

都说中国人民大学是中国数据库的发源地,其间汇聚了许多数据库

领域的顶尖学者,很想来探个究竟。如今这个梦想已经实现,我已来到人

大学习,确实被老师们严谨、求实、创新的精神所震撼,也被周围同

学的博学、上进所折服,从中也感受到自己的不足和差距,所有这些都

鞭策着我前进的脚步。我将严格要求自己,一步一个脚印的去谱写我未

来的博士生生涯。我想,这也将是我人生中一段别样的旅程!

实验室新生感言

Page 14: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

符彦军 2014 级硕士研究生

还记得大学的时候最愿意做的事情就是每夜仰望星空,幻想着未来

的美好,想想那时真是太有时间了,也太容易满足,甚至会为能省下一

些零星的生活费而买到自己心爱的球拍而高兴很久,为实现一个小小的

而且很容易达到的愿望而兴奋不已。如今星空依旧,人却变得好匆忙,

匆忙到甚至不愿把时间浪费到抬头望星空,进入实验室以来,环境把自

己变得虽不算成熟但却务实了好多。现在,我知道天上还是有一颗星星

在看着我,但我更加明白星星不是用手所能抓住的,而是需要将所学的知识化作工具不断不

断地去接近它的。研究生的两年时间里,加油加油加油!!!

郭胜娜 2014 级硕士研究生

经过一年的努力奋斗终于如愿以偿进入了自己的梦想大学,来到人

大的短短几个月的时间,就让我感受到名校浓厚的文化氛围和严谨的学

术修养。进入到实验室,更是亲身感受到孟老师深厚的学术功底以及对

问题独到的见解和对前沿方向准确把握,同样感受到师兄师姐们高水平

的论文和杰出的成果。在大家的诸多帮助下,使我对研究生生活有了更

深刻的认识,对于研究方向有了更准确地把握。

曾经的成功或者失败都已过去,如今,在这研究生新的起点上,未来只会是更加广阔!

在未来的三年里,我要学会从思考中认识自我、从学习中寻求真理、从实践中赢得价值;尽

自己最大的激情和能力去追寻梦想,在以后更为激烈的竞争中乘风破浪,打造更加辉煌灿烂

的明天!

Page 15: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

-------------------------------------------------------------------------------------------------------

马友忠 2014 届博士毕业 现就职于洛阳师范学院

转眼已离开 WAMDM 实验室数月之久,但过往的岁月却历历在目。

虽然身在距北京千里之外的洛阳,但是依然关注着实验室的一切动态,

每天打开电脑,总会在浏览器中不由自主地输入那熟悉而又亲切的网

址,急切地想看到实验室的最新进展,了解实验室的每一个变化。

2010 年 9 月,我怀着提升自我的期望来到 WAMDM 实验室学习;

2014 年 7 月,带着不舍之情毕业、离开。整整四年的时间,我经历了

很多,改变了很多,也收获了很多。

记得刚到实验室的第一年,日子过得很苦,科研找不到突破口,实验室的日常任务也做

得一沓糊涂,加上当时孩子刚刚出生,所以各种事情压在一起,真有点喘不过气来。在实验

室同学的帮助下,经过一段时间的思考、调整之后,内心慢慢平静下来,逐渐找到了自己生

活和学习的节奏。从那以后,我开始努力地看论文,认真地补基础,也就是在那个时候,我

才真正地体会到“废寝忘食”四个字的含义。又经过一年的努力,我发表了自己入学后的第

一篇论文,看着那简短的录用通知,我在内心告诉自己,我在 WAMDM 实验室完成了我的

第一次蜕变。

除了科研之外,实验室还有很多日常的活动、任务需要处理。最初的时候,由于科研的

压力,再加上对这些日常事务没什么经验,所以很多小事都做不好,总是受批评。后来,我

认识到 WAMDM 是一个平台,除了训练自己的科研能力之外,自己应该充分利用这里的一

切来锻炼和提高自己。自此以后,我认真地去做每一件分配给我的任务,并努力想办法做好。

实验室主办的每一次学术会议之后,我都会很“准时“地感冒一次,但是看到会议圆满结束,

内心是满足的。慢慢地,我对很多事情不再畏惧,不管是有没有做过的事情,我都能够冷静

地面对,并且能够很快理清楚问题的思路。那时,我告诉自己我在 WAMDM 实验室完成了

经历本身就是一种财富,四年的时光在自己身上已经刻下了深深的烙印。现在虽然离开

了实验室,但是以前的很多习惯依然保留着。感谢 WAMDM 实验室给了我成长的环境,感

谢孟老师的悉心教导。

最后,真心祝愿 WAMDM 实验室的明天更加美好!

2014 年 11 月 20 日于洛阳师范学院

毕业生寄语

Page 16: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

霍江涛 2014 届硕士毕业 现就职于中国银行

记忆,像一颗颗珍珠,用思念的线连起来,点点滴滴,是生命中最值

得回味的瑰宝。在实验室的三年,带给了我一生都挖掘不完的财富,这里,

我经历了一生中最重要的阶段——研究生生涯。带着几丝懵懂,几丝莽撞,

几分希望,几分憧憬,来到了这里,而今,我已成功的迈出了工作生涯的

第一步。

还记得,曾经一起办过的年报,那是由孟老师主导,全实验室兄弟姐

妹的结晶,年报凝聚了我们这一年以来实验室的智慧成果,年报里面最多的是论文,最重要

的也是论文,翻开这本单薄而又厚重的记录,看着里面一页页密密麻麻的文字,你仿佛可以

清晰的看到,每一次实验室例会中孟老师的悉心指导,每一次小组讨论中师兄师姐们的热心

关怀。科研或许是一件枯燥的工作,但当你经过了无数个夜晚的苦苦思索,而终于找到新的

解决方案时,那种兴奋,任谁一生都无法忘怀。

孟老师曾说,实验室是一艘大船,带大家到达理想的港湾。是的,在这里,有浓浓的学

术氛围,有最前沿的科学知识,有众多与国内外知名学者交流的机会,还有很多可以很好锻

炼人的科研项目。在这里,你既可以在自己最擅长的领域自由驰骋,也可以在自己懵懂的领

域快速提升,只要你愿意,你总能在这里找到自己最想要投入的方向,直到你慢慢洗掉曾经

的浮华和青涩,生命中慢慢装满坚实的果实,让我们无论身处何时何地,都能够坦然面对,

无所畏惧。

当然,闲暇之余,实验室举办的各项活动,也是我最美好的回味,闭上眼,往日的一幕

幕拨动心弦,有与师兄师姐一起打乒乓球的快乐,也有与师弟师妹一起打羽毛球的欢欣,最

兴奋的是与众兄妹一起去承德游览避暑山庄和小布达拉宫,虽然天气不太晴朗,但朦胧的细

雨洗去了夏日的炎热和浮尘,让人倍感凉爽和畅快。

孟老师说,优秀是一种习惯。这句话,深深影响着我,拥有什么样的态度,就拥有什么

样的人生,尤其是当我工作之后,我越发的发现,再没有人像孟老师一样在身后时时刻刻督

促着自己,所有的事情,都要靠自己去努力,去自觉。只有把这句话放在心里,在事业的道

路上,不失进取之心,不吝拼搏之力,才能有所成就。

拼得十年寒热苦,赢得科研硕果成。实验室在孟老师的掌舵和师兄师姐的努力下,取得

了巨大的成绩。相信,各个师弟师妹一定能够接过实验室的接力棒,在以后的科研生活中,

取得更多的成果,在此衷心的祝福我们实验室越来越好!

Page 17: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

实验室近年来毕业生动态

2014 年毕业生去向

姓名 学历 时间 毕业去向

富丽贞 博士 2013 年 12 月 中北大学

马友忠 博士 2014 年 7 月 洛阳师范学院

张啸剑 博士 2014 年 7 月 河南财经政法大学

王淼 硕士 2014 年 7 月 中国建设银行北京分行

刘春秋 硕士 2014 年 7 月 中国保险信息技术管理有限责任公司

霍江涛 硕士 2014 年 7 月 中国银行数据中心

韩旭 硕士 2014 年 7 月 微软中国有限公司

赖文豫 硕士 2014 年 7 月 淘宝(中国)软件有限公司

实验室校友职业发展:

谢敏: WalmartLabs 数据科学家。

(谢敏 2008 年 7 月从实验室毕业,获工学硕士学位)

实验室校友获得项目资助:

周军峰:“基于事前约束的 XML 关键字查询处理技术”,国家自然科学基金项目,项目

代码:F020204(周军锋于 2009 年 7 月从实验室毕业,获工学博士学位)

Page 18: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

实验室校友发表的论文:

Xiao Pan, Lei Wu, Zhaojun Hu, Zheng Huo: Voronoi-Based Spatial Cloaking Algorithm

over Road Network. DEXA (2) 2014: 273-280. (潘晓于 年 7 月从实验室毕业,获工

学博士学位)

Xiao Pan, Lei Wu, Chunhui Piao, Xiaoshuo Xu: P3RN: Personalized Privacy Protection

Using Query Semantics over Road Networks. WAIM 2014: 323-335

Zhongyuan Wang, Ruimin Hu, Zhenfeng Shao, Zhiqiang Hou: Parameter estimation in

sparse representation based face hallucination. Digital Signal Processing 31: 28-34 (2014)

(王仲远于 2010 年 7 月从实验室毕业,获工学硕士学位,2012 年 9 月进入中国人民大

学信息学院在职攻读博士学位)

Zhongyuan Wang, Zhen Han, Ruimin Hu, Junjun Jiang: Noise robust face hallucination

employing Gaussian-Laplacian mixture model. Neurocomputing 133: 153-160 (2014)

Zhongyuan Wang, Jun Chang, Ruimin Hu, Rui Zhong: Generating algorithm for integer

DST radixes in video coding. J. Visual Communication and Image Representation 25(8):

1918-1921 (2014)

Zhongyuan Wang, Ruimin Hu, Shizheng Wang, Junjun Jiang: Face Hallucination Via

Weighted Adaptive Sparse Regularization. IEEE Trans. Circuits Syst. Video Techn. 24(5):

802-813 (2014)

Zhongyuan Wang, Haixun Wang, Zhirui Hu: Head, modifier, and constraint detection in

short texts. ICDE 2014: 280-291

Munan Li, Junfeng Zhou, Xin Liang: Modeling and Description of Organization-Oriented

Architecture. JSW 9(4): 867-872 (2014)

Junfeng Zhou, Zhifeng Bao, Wei Wang, Jinjia Zhao, Xiaofeng Meng: Efficient query

processing for XML keyword queries based on the IDList index. VLDB J. 23(1): 25-50

(2014) .(周军锋于 2009 年 7 月从实验室毕业,获工学博士学位)

Page 19: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

2014 年实验室活动集锦

Page 20: 实验室研讨会 - Renmin University of Chinaidke.ruc.edu.cn/reports/report2014/Appendix/Appendix.pdf · 2018-03-21 · The seminar theme is:massive data management. In this presentation,

博士生张啸剑赴宾夕法尼亚州参加 SIAM SDM2014

博士生王璐赴香港浸会大学学习交流

硕士生赵可君赴微软亚洲研究院实习交流

孟小峰教授应邀在高效做报告

硕士生张榆赴波兰华沙大学参加 WI2014

硕士生干艳桃获谷歌奖学金

孟小峰教授主持 CNCC 专题论坛

实验室师生参加 WAIM/SAP Summer School

孟小峰教授应邀赴港参加海量数据研讨会

孟小峰教授赴香港华为诺亚方舟实验室访问