Big Data in the Natural Sciences and Humanities - Max

Transcription

Big Data in the Natural Sciences and Humanities - Max
The ERTC is funded by the CAS and the MPG
Chair MPG – Hans Wolfgang Spiess
Director (em.) Max Planck Institute for Polymer Research
Steering Committee CAS
Chair: Huadong Guo, Member of CAS,
Institute of Remote Sensing and Digital Earth, CAS
Runsheng Chen, Member of CAS, Institute of Biophysics, CAS
Jianjun Sun, President, School of Information Management, Nanjing University
Zhiwei Xu, Institute of Computing Technology, CAS
Steering Committee MPG
Chair: Thomas Lengauer,
Managing Director of the Max Planck Institute for Informatics
Claudia Draxl, Physics Department and IRIS Adlershof, Humboldt Universität zu Berlin;
Max Planck Fellow at the Fritz Haber Institute of the Max-Planck-Gesellschaft
Klaus-Robert Müller, Head of Group Machine Learning, Technical University of Berlin
Jürgen Renn, Director at the Max Planck Institute for the History of Science
Matthias Scheffler, Director at the Fritz Haber Institute of the Max-Planck-Gesellschaft
Martin Vingron, Managing Director of the Max Planck Institute for Molecular Genetics,
Director at the CAS-MPG Partner Institute for Computational Biology
S
CE
SE
NE
Co-Chair CAS – Xu Zhang
Vice President of Shanghai Branch, CAS
Director of Shanghai Institute for Advanced Studies, CAS
Professor of Institute of Neuroscience, SIBS, CAS
C HI
Chair CAS – Wenqing Shen
Member of CAS
Former Vice President of NSFC
Chair of Shanghai Institute for Advanced Studies, CAS
Professor of Shanghai Institute for Applied Physics
ACA
I
EN
Organizers
DE M Y OF
SC
CHINESE ACADEMY OF SCIENCES
Max-Planck-Gesellschaft
Hofgartenstraße 8
80539 Munich, Germany
P.O. Box 10 10 62
800084 Munich, Germany
CHINESE ACADEMY
OF SCIENCES
52, Sanlihe Road
Beijing 100864, China
www.cas.cn
www.mpg.de/en
VENUE
6th Exploratory Round Table Conference
Shanghai Institute for Advanced Studies
Chinese Academy of Sciences
Building 1, 319 Yue Yang Rd.
Shanghai 200031, China
Big Data in the
Natural Sciences and Humanities
Shanghai, November 19 th to 21 st, 2015
Fellows MPG
Marcel Schulz, Cluster of Excellence Multimodal Computing and Interaction,
Saarland University / Max Planck Institute for Informatics
Luca Ghiringhelli, Fritz Haber Institute of the Max-Planck-Gesellschaft
Grégoire Montavon, Technical University of Berlin
Florian Schmaltz, Max Planck Institute for the History of Science
Robert Schöpflin, Max Planck Institute for Molecular Genetics
CAS Headquarters Beijing
Tieniu Tan, Bureau of International Co-operation, CAS
Jiaofeng Pan, Bureau of Development and Planning, CAS
Feng Zhang, Bureau of Development and Planning, CAS
Fang Jiang, Bureau of Development and Planning, CAS
Wang Dongyao, Bureau of International Co-operation, CAS
CAS Shanghai
Wenjun Zhang, Shanghai Institute for Advanced Studies (SIAS)
Xiaolong Teng, Shanghai Institute for Advanced Studies (SIAS)
Fang Xue, Shanghai Institute for Advanced Studies (SIAS)
MPG Administrative Headquarters Munich
Christoph Ettl, Presidential Division, Scientific Coordination
Christiane Walch-Solimena, Presidential Division, Scientific Coordination
Sabine Panglung, Division for International Relations
C O N TA C T
Ms Fang Xue
(Conference Office) Ms Sabine Panglung
Dr. Christoph Ettl Dr. Christiane Walch-Solimena
[email protected]
Phone: +86 21 6433 7978
Fax: +86 21 6433 7927
[email protected]
[email protected]
[email protected]
Cover image: Tag cloud of Big Data for the sciences © WordItOut.com
Fellows CAS
Lizhe Wang, Institute of Remote Sensing and Digital Earth, CAS
Jianjun Luo, Institute of Biophysics, CAS
Huawei Shen, Institute of Computing Technology, CAS
Dongxiao Gu, School of Information Management, Hefei University of Technology
Dong Liang, Institute of Remote Sensing and Digital Earth, CAS
ERTC on Big Data in the Natural Sciences and Humanities
I n v i t e d S p e a k e r s a n d P ar t icip a n t s
Big Data has become a ubiquitous notion in recent years. Technological developments,
in particular in informatics and high-throughput approaches, have revolutionized data
generation in all fields of science. As a consequence, researchers in almost all areas
of science face new and unforeseen challenges: Gathering data is so easy and quick
Alessandro DE VITA (King‘s College London, UK)
that it exceeds by far the capacity to validate, analyze, visualize, store, and curate
all the information. Tackling this challenge will without doubt lead to unprecedented
data-driven scientific discoveries.
SIAS House
E x p lo r a t o r y R o u n d Ta ble C o n f e r e n c e s
o f t h e C hin e s e A c a d e m y o f S cie n c e s
a n d t h e M a x- P la n c k- Ge s ells c h a f t
Exploratory Round Table Conferences or ERTC are a joint activity
of the Chinese Academy of the Sciences (CAS) and the Max-PlanckGesellschaft (MPG) under the auspices of the Shanghai Institute of
Advanced Studies (SIAS).
ERTC are intended to provide a platform for scientists of both MPG and
CAS to exchange ideas and reflect on opportunities of newly emerging
research areas together with the respective international key players at
an early stage of these evolving fields. The main objective of the project
is to act as a seed towards establishing new topical areas as part of a
priority-setting process at the leading edge of science in the supporting
organisations CAS and MPG. Moreover, the reports of the ERTC will be
widely communicated to both science policy makers as well as to the
general scientific community. ERTC are to be held at Shanghai in the
premises of SIAS at least once a year.
MPG and CAS have maintained an exclusive partnership for over 30
years. The SIAS is an Institute of the CAS Shanghai Branch and was
founded in 2001 with the support of the MPG as a hub for interdisciplinary and international dialogue.
Each ERTC has a total duration of 3-6 months for preparation and followup. The topic of the 6th meeting will be Big Data in the Natural Sciences
and Humanities, including the following subthemes:
In the field of biomedicine, the dramatic advances in technologies that can be summed
up as omics, such as high throughput DNA-sequencing, lead to vast amounts of data
at dramatically plummeting costs. This revolution in biomedical research raises high
expectations as to the increase of knowledge, understanding of health and disease,
and eventually the development of powerful therapies to treat thus far uncurable diseases such as cancer or depression, in a personalized and precise fashion.
Such data-driven methods are revolutionizing not only drug design and drug discovery.
Regarding chemistry and materials science, Big Data techniques in combination with
computational modeling facilitate analyzing the vast space of yet unexplored compounds and materials - thus complementing and in several cases even replacing experiments. This high-throughput screening needs to be combined with novel big-data
analytics tools, which then enables the identification of new scientific phenomena,
advances materials science and engineering, and predicts materials with technologically relevant properties and functions.
Last but not least, the analysis of large volumes of data opens up new avenues of
research in the field of the humanities and social sciences. The wealth of data which
is already born digital as well as mass digitizing existing analog data allow to answer
complex questions that were previously unanswerable. Analyzing the development
and diffusion of knowledge, modeling cultural evolution, and predicting human behavior are just some of the challenges that lie ahead.
Big Data requires innovative technologies to efficiently process large quantities of
data within tolerable elapsed times. Machine learning is one of today’s most rapidly
growing technical fields lying at the core of data science and artificial intelligence,
indispensable for analyzing and classifying data. This development also provides
challenges for theory-building. Whereas data mostly exhibit correlations and statistical dependencies, theory provides causal relationships. The interplay between data
mining and theory building is an important issue, as Big Data continues to pervade
scientific and private life. Furthermore, data capture both in the health segment and in
daily life can pose a severe threat to privacy, as individuals are increasingly divulging
data relating to individual behavior and performance.
Michael BACKES (Saarland University, Germany)
BI Jun (Nanjing University, China)
CHENG Xueqi (Institute of Computing Technology, CAS, China)
Linda DIVARCI (Max Planck Institute for the History of Science, Germany)
Roland EILS (University Heidelberg, Germany)
Gerd GRASSHOFF (Humboldt University of Berlin, Germany)
GUO Huadong (Institute of Remote Sensing and Digital Earth, CAS, China)
Moritz HELMSTAEDTER (Max Planck Institute for Brain Research, Germany)
HUANG Cui (Tsinghua University, China)
JIANG Hualiang (Shanghai Institute of Materia Medica, CAS, China)
Manfred LAUBICHLER (Arizona State University, USA)
Thomas LENGAUER (Max Planck Institute for Informatics, Germany)
LI Hong (Institute of Physics, CAS, China)
LI Jiang (Zhejiang University, China)
LI Yixue (Shanghai Center for Bioinformation Technology, SIBS, CAS, China)
LUO Jianjun (Institute of Biophysics, CAS, China)
Klaus Robert MÜLLER (Technical University of Berlin, Germany)
Nico PFEIFER (Max Planck Institute for Informatics, Germany)
Matthias SCHEFFLER (Fritz Haber Institute of the Max-Planck-Gesellschaft, Germany)
Matthias SCHEMMEL (Max Planck Institute for the History of Science, Germany)
WANG Jun (Beijing Genomics Institute, China)
WANG Lizhe (Institute of Remote Sensing and Digital Earth, CAS, China)
XU Zhiwei (Institute of Computing Technology, CAS, China)
ZHANG Baichun, (Institute for the History of Natural Sciences, CAS, China)
T A TNAUTNER
H ESCHE
B PAWLEK
H C SPATZ
HCH SPATZ
M BEHNCKE
G BEHRENS
A X CZANDERNA
P I ROTTI
H HENNING
P PIERRLICII
P H T A TRAUTNRR
K H NIERHAUS
H G SCXIWEIGER
O FRANK
SCHERZINGEII
P TIIAMMANA
G STÖRPLER
WFIG
SCHMID
SCHWEIGER
H GHSCHWEIGER
K KLOPPSTECII
G SCIIWEIGER
R MASCHLER
L P TRAUB
I I BAUMHACKER
K ABEL
S BERGERM SCHWEIGER E P
H G WITTMANN
G BELLEMARE G HONMANN
P E HERRLICH
E SCIIERZINGER
H G SCNWMONN
SCHERZINGER
I E DEUSSER
C G KURLAND
H A RAIIMSDORF
R A GARRITJPI
E FILL
X PNNCM
W PRECIIT
BI G
BERGER
J I HERRLICII
II
Y OKADA
SCHWEIGER
R LLINIS
M SCNWUXCUU I ZEICIIIIARDT
F I SCNNÖRMX
H P MORSCH
C U BODE P N GRAY
H FBRINKSCHULTE
OESCHLER
G GRIGORIU
H F I HOMEYER
R MONIER R K BAUER
R HASSLßN
A MILLAR
E KUECHLER
R A GARREII
CFA BRYCE
R LIPPERHEIDE
F HAJDU
I CZERNILOFSKY R R CRICHTON C F BIIYCE
RLC GUMMING
L BAUM
G STÖFLLER
I FUCHS
R EASON
A FEUERSTEIN
R SANTO
P I OESCHLER
A BARCLAY
F GNOS
G
HAUL
C R HASSLER
F I KRÄUTLE
F I LUDECICE
F I SCHRÖTER
ED
KOHNE
C SCHULTE
LAZAR
G KLOTZ
N NANNINGA
I HERZER
S KALBITZER
L
G R NANNINGA
J P PONPON
A GARRETT
DAYAGROSJEAN
P SIFFERT
H G WRRRMANN
MA
NITSCII
H HERZER
E WAGNER
K N REINWALD
E REINWALD
R STOCK
WRACOMFORT
LESTER
L YNTEMA
W FEUERLEIN
M R WAHL
W PNUCM
H G BOHLEN
V OERTZEN
R BOCK
H HUGUTBROD
C
SCIILOTTHAUERVOOS
I I BOHLEN
W V OEIUZEN
B KOHLMEYER
W B CUOI
GN
MARQUARDT
M FEIL
A GAMP
Big Data in Biomedicine
Big Data in Physics, Chemistry and Earth Science
Big Data in the Humanities and Social Sciences
Technology Underlying Big Data
Point cloud of the only modern building survey of the Pantheon in Rome
© MPI for the History of Science
F CRMZUNN
F CXMMZU
I FIEIDER
W A MAELICKE
G ENGEL
F CRAMER
F V D HAAR
H SCIIEIT
Ü SCHNEIDER
D H GAUSS
F I KÖSTER
E PIOLZBERG
F WALTHER
I I PACHER
P W JAVEL
J G WEONOWE
B R BALDS
I I RENNER
D R BLACK
A M FRISCHAUF
G GREBE
D P BOBON
D J MATTKE
I V IOESCH O SIMONOVA
H H SCHULZ
H U WIDDEL
H R BREHME
S GOODY
BAGSHAWCR
W G TIRSCH
W PUONSX
K SATO
G RUMRICH
S KLÖSS
I I WUNDERLICH
F I SCHENK
P SCHMIEDEK
J SIMON
K M PIRKE
F I WISSER
J K MERTIN
JGBARRINGTON
SCHÄFER
LEIGH
EM
E WILDE
MANDELKOW
R WOLLMANN
J HOEKSTRA
G WENDLER
I I SCHULZE
D BINGMANN
UD
RUPPIEIII
W
V TOELLE
WSCHOENIIERR
SCHULZE
I WALTER
K A BECKER
BK
ZIMMERMANN
K BULLRICH
G G GRABITZ
D A MOOTZ
P KELLER
R ATTIG
A RABENAU
R KNIEP
H E HÖRSTER
V MOENNIG
L PISTER
F DEIN IIARDT
R TIMPL
W BAUHOFER
P I BACKMUND
M T TACKE I I ULLRICH
K KÜHN
F I BAUER
K H RIEDERL GENZEL
E K HRSG
I ACXUN
LENIGERFOLLERI
R HUCNA ITUCN
R E WODICK
L TECKNAUS
F I KELLER W JELLINGIIAUS
D X LÜBBRRS
H E WEIGELT
H BAUMGÄRTL
P D SCHÄKER
S SCHNEIDER
I I KELLER
A KRISCH
I L KUDIELKA
C H WEIGELT
U GRAMBERG
W GRÜNEWALD
A HUCII
D HORSTMANN
M BOLDT
S SCHUCIIEIARDI
STEIGERGJ
W LÜBBERS C RÜEGG HRSG
E K FQLLEM
KEASTOSSECK
KHUCH
LENXOERFOLLERT
K MÖL LING
R KURTII
H D BAUER
T GRAF
F I GELDERBLOM
I KRÖN
M D KESSLER
W M KESSLER
J HÖPER
I LANGER
I I LANG
WM
WESSEL
THERMAN
K BRAND H M LANG
L E GRÖRNANDT
E L SINAGOWITZ
L
K L MESSMER
FSUNDERPLASSMANN
JESCH I I RAHMER
F JESEK
M BLÄTIE
P MUTHIAII
R FURTIMAYRH
K KÜIIN
B RENUCCI
I T KLOSE
W F BEIL
E SCHNEIDER
D STORZER
BI IPLECHTIG
R N BAKER
B P GLASS
R GIJBELS
D D HEYMANN
T P KIRSTEN
P FIORN
G POUPEAU
O A SCHAEFFER
W GENTNER
I KANEOKA
G HEUSSER
M JIXOVA
S K THIOJ KIKO
J BOÖEK
Z CEPLECHA
W KRATSCITMER
I I GELDERBLOM
T PLIENINGER
O MEDENBACH
O MÜLLER
M P PAVICEVIC
P RAMDOHR
P BOLOGNESI
L C CLARK
I A SILVER
D D SCHÄFER
I I STARLINGIÖR D F BRULEY
M ILLIERIMIANN
P FIETZEK
F W REXRODT
P P WENDT
M STARK
K MOLLINO
G HÜPER
E I ACKER
W PIUSSMANN
D SCHARIZIT
U BECKER
M BOTHE
T Z STEIN
W D STREUBEL
I T KARGL
GMGA
WAGNERD DEHNIIARD
H OHNUMA
H MACKHH
G MAIRLE
W W DAEHNICK
T S BIIATIA
F I FURTHMAYR
C M LAPIERE
G WICK
M F STOLTZ
F I FURTIIMAYR
E FRÖMTER
B GEBLER
MWIGAND
I KNAUFJ A YOUNO
I I FIENNEMANN
W PRETTL
T I MARTIN
C R BECKER
A W IIEIDLAND
M ISHIGAME
FS ICARPENTER
FAUL
M REIMER
R GOLLWITZER
F KETTEL
R KOCZOREK
A MIRSUXSXXL
Y HOMMA
G N KLIPPING
I HACKER
J H G MANNIIERZ
GJ
R H SCIIIRMER
H G MANNIIERZ
BARRINGTONLEIGH
D R TRENTHAM I GEMANNHERZ
G ROSE
SCHLIMME
J H G MANNHERZ
R GOONY C HOLMES
C R BAGSHAW
D FWECCLESTON
YATES
H J JAECKEL
T H FREDKRKING
E ELELMREICH
H RINGLER
H RENNER
G PACIIER
F ECXSTEIN
I I GIEDKE
I L WISSER
ROOR
JGGERNHARDT
R ALLGEYER
O KLÜBER
W JAKOBUS
F KERL
J E GRUBER
R POHLCIIEN
J BÄUMLER
G IIERPPICH
E BREIT
KNOBLOCH
I AI LOHNERI
I I IIASSLEN
BROSWALD
PÖNL
CIIEN CIIEN
JHÄCLSPERGER
JUNKER
GRWULFF
JPI IKOLOS
RPÖHL
CM
S REHKER
MEYER
JSTREIBL
KOLOS
GII WOLF
I IRWOBIG
CHEN
IPÖNL
KF GROSS
KUNZE
G
WOLF
GH
SCHILLING
GHWOLI
KOTTMAIR
GM
J GRIEGER
K
BLAUMOSER
A KNOBLOCII
R KAUFMANN
J G WEGROWE
FREUDENBERGER
K A ELSNER
JUULMAN
I ICOL
RINGLER
G DUESINGE WETTERER
JAENICKE
I I RING RLER
G PASCHER
I IIIIEGLII
EGWÜNSCIIING
R CANO
E WÜR SCIIINO
G LISITANO
S G CORTI
B ZANFAGNA GLISITANO
W MAURER
W HAAR
F I RÜTERJANS
R WEVER
D STAMM
M
FATRANSKA
K KATAOKA
C F K ZEILLER
E DECLERCQ
T C MERIGAN
M SCHIDLOWSKI
P I WINKLER
J GAST
This ERTC aims at elaborating a critical review of the presently existing ideas, strategies and aspirations of Big Data science. The results of the ERTC will serve as a basis
for further consideration by CAS and MPG regarding research in this field.
L DOLAN
I STERNBACH
J F HOBBS
A D X LÜBBERS
•
•
•
•
M SPRINZL
K GRIESE
W SEILER
SCHMIDT
H JURÜGER
F RADLER
M KEMPFLE
I I TACKE
G DIECKMANN
Y NAKAMURA
O R BALD
P WREDE
T WAGNER
J ZIMMERMANN A ERDMANN
R HORNE
K H SCHEIT
F CNARUEN
A RHUC
G BXUMLER
O R PONGS
H G DOBERER
W MCCLURE
C JUNGE
F I GIEDKE
H P WISSER
P DOERR
MJALTHERIB
S IIÖGLUND
L LJUNG
ANATARAMAN
J IASCIIIIFER
I R CCOMFORI
E RIDINGER
J UMBARCIER
K A EBERHARD
H S PLENDL
U STROH BUSCH
N GRAMA
D BURCH
M HAMM
KO
L
GROENEVELD
W PAPAJEWSKI
MEYERSCHÜTZMEISTER
P
A RICHTER
M A LEE
C K GELBKË
BRAUNMUNZINGER
I T FORTUNE
W WANN
D S FADER
R H SIEMSSEN
KD
W TIPPIE
D FICK
W WMSS
FIILDENBRAND
P C SCHWINDT
F I FRANK
M R BLOCH
A ELGORESY
L A TAYLOR
K ZECHEL
I GORESY
W SCIIRANKEL
A L IDSCIIOK
BD LÜERS
I GEYER
F I MUCK G W
SCIIWARZSCHILD
J W LAMSA
P FREUNDH FRANZ
BRANDENBURG
P KOBEI I FRANK
P J IFISCIIINGER
OPUNAN
M V NEKMUT
W SCHRANKEL
R W SCHÄFER
V BLOBEL
DESY
W J RICHTER
BONN JJMÜCK
J LANGE
M L SHAND
N SCHMITZ
H PRIESS
J G GAY
W ZILLIG
K BUCHNER
K ZECIIEL
G GÖBEL
G DEIIM
W GEIST
W WITTEK
M CARDONA
N SHEVCHIK
F CERDEIRA
M BETTINI
I C M PENCHINA
W DREYDRODI T A FJELDLYD X LANGER
J TEJEDA
F WILLMANN
T ISNIGURO
T ISMOUNO
C ELBAUM
J S LANNIN
N SHEVCIIIK
G B ITYAMSW KOCH
P WEILIIARIMER
WWKOCII
H BLUM
P WEILHAZMNIER
G LÜTJENS
B WINSTEIN
B HYAMS
G B PGRAYER
WEILHAMMER
P
E LORENZ
ICTJONES
DIETL
R TIRLER
I I LIPPMANN
W OUBURGER
L S PEAK
STIER
LIN
L S ROCHESTER
W MAENNER
J
SCIILEIN
C RMGRUHN U STIERLINJ MEISSBURGER
B HXRAMS
MEISS
D ZAHNISER
W
OCHS
W LANGBEIN
BOTTBODENHAUSEN
D O CALDWELL
C W FABJAN
F SAULI
ELORENZ
F J WAGNER
HDIETL
C LÜTJENS
Connectivity between scholars at various research institutes, based on
co-authorship in 1973 © MPI for the History of Science