Tutorial Sessions/Invited Talks
All tutorials and invited talks are free to registered
conference attendees of all conferences held at
CSCE'17. Those who are interested in attending one
or more of the tutorials are to sign up on site at the
conference registration desk in Las Vegas. A complete &
current list of
CSCE
Tutorials
can be found later on the
congress website.
In addition to tutorials at other conferences,
DMIN'17 aims at providing a set of tutorials dedicated
to Data Mining topics. The 2007 key tutorial was given
by Prof. Eamonn Keogh on Time Series Clustering. The
2008 key tutorial was presented by Mikhail Golovnya
(Senior Scientist, Salford Systems, USA) on Advanced
Data Mining Methodologies. DMIN'09 provided four
tutorials presented by Prof. Nitesh V. Chawla on Data
Mining with Sensitivity to Rare Events and Class
Imbalance, Prof. Asim Roy on Autonomous Machine Learning,
Dan Steinberg (CEO of Salford Systems) on Advanced Data
Mining Methodologies, and Peter Geczy on Emerging
Human-Web Interaction Research. DMIN'10 hosted a
tutorial presented by Prof. Vladimir Cherkassky on
Advanced Methodologies for Learning with Sparse Data. He
was a keynote speaker as well (Predictive Data Modeling
and the Nature of Scientific Discovery). In 2011, Gary
M. Weiss (Fordham University, USA) presented a tutorial
on Smart Phone-Based Sensor Data Mining. Michael Mahoney
(Stanford University, USA) gave a tutorial on Geometric
Tools for Identifying Structure in Large Social and
Information Networks. DMIN'12 hosted a talk given by
Sofus A. Macskassy
(Univ. of Southern California, USA) on Mining
Social Media: The Importance of Combining Network and
Content as well as a talk given by Haym Hirsh (Rutgers
University, USA): Getting the Most Bang for Your Buck:
The Efficient Use of Crowdsourced Labor for Data
Annotation. Professor Hirsh was a
congress keynote
speaker, too.
In addition, we hosted tutorials
and invited talks held by Peter Geczy on Web Mining,
Data Mining and Privacy: Water and Fire?,
and Data Mining in Organizations. DMIN'13 hosted the
following tutorials:
EXTENSIONS
and APPLICATIONS of UNIVERSUM LEARNING
presented by
Vladimir
Cherkassky (Dept. Electrical & Computer Eng.,
University of Minnesota,
Minneapolis, USA),
Visualization
& Data Mining for High Dimensional Datasets
presented by Alfred
Inselberg, (School of Mathematical Sciences, Tel
Aviv University, Tel Aviv,
Israel)
as well as invited talks: Big Data = Big Challenges?
given by Peter Geczy (National Institute of Advanced Industrial
Science and Technology (AIST), Japan)
and The Problem
of Induction: When Karl Popper meets Big Data
given by Vladimir Cherkassky.
DMIN' 17 will be hosting the following
tutorials/invited talks:
Invited Talks
Invited Talk A |
Speaker |
Dr. Peter Geczy
National Institute of Advanced Industrial
Science and Technology (AIST), Japan |
|
Topic/Title |
Data Economy: The New Gold Rush? |
Date & Time |
Tuesday, July 18, 09:20 - 10:20am |
Location |
Copper Room |
Description |
If data is the ‘new gold’, is
data mining the new gold rush? Companies and
governments are racing to accumulate and exploit
as much data as possible. Data about customers,
operations, transactions, interactions – and the
list continues. Data has been a significant
innovation driver for several segments of
developed economies. It is among the most prized
assets of not only data-driven technology
companies but also governments and individuals.
Data has a substantial inherent value.
Realization of this value drives the rapidly
expanding data-oriented technology sector. Data
mining technologies have been playing a central
role in an increasing spectrum of economic
activities. Growing data economy has been
manifesting along several dimensions. We shall
explore the pertinent dimensions of data economy
and trends at the intersections of academic and
commercial interests in data-oriented
technologies.
|
Short Bio |
Dr. Peter Geczy holds a senior position at the
National Institute of Advanced Industrial
Science and Technology (AIST). His recent
research interests are in information technology
intelligence. This multidisciplinary research
encompasses development and exploration of
future and cutting-edge information
technologies. It also examines their impacts on
societies, organizations and individuals. Such
interdisciplinary scientific interests have led
him across domains of technology management and
innovation, data science, service science,
knowledge management, business intelligence,
computational intelligence, and social
intelligence. Dr. Geczy received several awards
in recognition of his accomplishments. He has
been serving on various professional boards and
committees, and has been a distinguished speaker
in academia and industry. He is a senior member
of IEEE and has been an active member of INFORMS
and INNS. |
Invited Talk B |
Speaker |
Diego Galar,
Division of Operation and Maintenance
Engineering, Luleå University of Technology,
971 87 Lulea, Sweden,
diego.galar@ltu.se |
|
Topic/Title |
Industrial data science
and black swans
|
Date & Time |
Tuesday, July 18, 10:40am - 12:20pm |
Location |
Copper Room |
Description |
Industrial systems are complex with respect to technology and operations
with involvement in a wide range of human
actors, organizations and technical solutions.
For the operations and control of such complex
environments, a viable solution is to apply
intelligent computerized systems, such as
computerized control systems, or advanced
monitoring and diagnostic systems. Moreover,
assets cannot compromise the safety of the users
by applying operation and maintenance
activities. Industry 4.0 is a term that
describes the fourth generation of industrial
activity which is enabled by smart systems and
Internet-based solutions. Two of the
characteristic features of Industry 4.0 are
computerization by utilizing cyber-physical
systems and intelligent factories that are based
on the concept of "internet of things".
Maintenance is one of the application areas,
referred to as maintenance 4.0, in form of
self-learning and smart systems that predicts
failure, makes diagnosis and triggers
maintenance by making use of “internet of
things”.
Thus, for complex assets, much information needs to be captured and mined to
assess the overall condition of the whole
system. Therefore the integration of asset
information is required to get an accurate
health assessment of the whole system, and
determine the probability of a shutdown or
slowdown. Moreover, the data collected are not
only huge but often dispersed across independent
systems that are difficult to access, fuse and
mine due to disparate nature and granularity. If
the data from these independent systems are
combined into a common correlated data source,
this new set of information could add value to
the individual data sources by the means of data
mining.
However the data collected are not sufficient due to the black swan effect
which pop up by the means of rare events not
considered by the data driven models. The black
swan events is a metaphor that describes an
event that comes as a surprise, has a major
effect, and is often inappropriately
rationalized after the fact with the benefit of
hindsight. The term is based on an ancient
saying which presumed black swans did not exist,
but the saying was rewritten after black swans
were discovered in the wild.
This
talk will discuss the possibilities that lie
within applying the maintenance 4.0 concept
in the industry and the positive effects on
technology, organization and operations from
a systems perspective and its limitations if
black swans are neglected.
|
Short Bio |
Prof. Diego Galar
holds a M.Sc. in Telecommunications and a
PhD degree in Design and Manufacturing from
the University of Saragossa. He has been
Professor in several universities, including
the University of Saragossa or the European
University of Madrid, researcher in the
Department of Design and Manufacturing
Engineering in the University of Saragossa,
researcher also in I3A, Institute for
engineering research in Aragon, director of
academic innovation and subsequently
pro-vice-chancellor.
He has authored
more than two hundred journal and conference
papers, books and technical reports in the
field of maintenance, working also as member
of editorial boards, scientific committees
and chairing international journals and
conferences.
In industry, he
has been technological director and CBM
manager of international companies, and
actively participated in national and
international committees for standardization
and R&D in the topics of reliability and
maintenance.
Currently, he is
Professor of Reliability and Maintenance in
Skovde University, holding the VOLVO chair
for maintenance, and Professor of Condition
Monitoring in the Division of Operation and
Maintenance Engineering at LTU, Luleå
University of Technology, where he is
coordinating several EU-FP7 projects related
to different maintenance aspects, and was
also involved in the SKF UTC center located
in Lulea focused in SMART bearings. He is
also actively involved in national projects
with the Swedish industry and also funded by
Swedish national agencies like Vinnova.
In the
international arena, he has been visiting
Professor in the Polytechnic of Braganza
(Portugal), University of Valencia and NIU
(USA), currently, University of Sunderland
(UK) and University of Maryland (USA). He is
also guest professor in the Pontificia
Universidad Católica de Chile. |
Tutorials
Tutorial A |
Speaker |
Gary M. Weiss
Interim Chair, Associate Professor &
Director of Wireless Sensor Data Mining
(WISDM) Lab, Dept. of Computer and
Information Science, Fordham University,
Bronx NY, USA
Andrew H. Johnston
WISDM Lab, Fordham
University, Bronx NY, USA
|
Topic/Title |
Terrorists, Hackers, and Criminals:
Understanding the Darknet
|
Date & Time |
Tuesday, July 18, 04:20 - 06:00pm |
Location |
Copper Room |
Description |
The Darknet is the area
of the internet that terrorists, child
pornographers, hackers, and insider traders call
home. Utilizing technologies like TOR, I2P, and
ZeroNet, the darknet’s anonymity and distributed
nature make law enforcement operations all but
impossible. Likewise, the covert and secretive
nature of most sites means that most sites are
known only to their users. For those not a part
of the underworld, the darknet represents an
interesting research opportunity. Otherwise
hard-to-find underworld groups roam freely, and
there are many opportunities to generate
interesting data sets.
In this tutorial, we
introduce the audience to the different
technologies and explore what type of security
features are used by the different underworld
groups. We will also briefly cover different
attacks and potential attacks that could be used
to break the security features provided by the
different darknet networks. We will also cover
some new research that attempts to recognize
terrorist content from benign content, and its
use as a tool for finding terrorist content on
both the darknet and the regular internet. We
will discuss how similar models can be made
using text, image, and graph mining techniques.
This tutorial is suitable
for a general audience, but is especially
recommended for data scientists, researchers,
and cybersecurity practitioners who have an
interest in cultivating data from criminal and
terrorist enterprises, exploiting anonymity
networks, or data mining on such networks.
|
Short Bio |
Gary Weiss is an associate professor,
and interim department chair, in the
Department of Computer and Information
Science at Fordham University in New
York City. He is the director of the
Wireless Sensor Data Mining (WISDM) Lab,
which explores how smartphones,
smartwatches, and other mobile sensors
can be used to support human activity
recognition, biometrics, and other
sensor-based applications. More recently
he has started research on the Darknet
and the use of text mining to identify
terrorist sentiment. His work is funded
by the US National Science Foundation,
Google, and several other industry
partners. He has published over fifty
papers in machine learning and data
mining.
Andrew Johnston is with
Fordham University's Department of
Computer Science and is a member of the
Wireless Sensor Data Mining (WISDM) Lab.
Andrew specializes in using data mining
and artificial intelligence techniques
to explore and improve security systems.
His recent research has focused on using
data mining techniques to create the
first gait-based biometric system using
a smartwatch. He is a coauthor of
“Mobile Biometrics”, the first textbook
on the topic, scheduled for release in
September 2017. Andrew has worked with
City of Hope Hospitals, LaQuinta Hotels,
Staples, and with the FBI employing a
data-driven approach to improving
cybersecurity.
|
Slides |
Darknet Tutorial
Slides |
Tutorial B |
Speaker |
Ulf
Johansson, Department of Computer Science
and Informatics, Jönköping University,
Sweden,
ulf.johansson@ju.se
|
Topic/Title |
Conformal Prediction - Predicting with
Confidence
|
Date & Time |
Monday, July 17, 03:40 - 05:40pm |
Location |
Copper Room |
Description |
How good is your
prediction? In risk-sensitive applications, it
is crucial to be able to assess the quality of a
prediction, but traditional classification and
regression models don't provide their users with
any information regarding prediction
trustworthiness.
Conformal predictors,
on the other hand, are predictive models that
associate each of their predictions with a
precise measure of confidence. Given a
user-defined significance level E, a conformal
predictor outputs, for each test pattern, a
multivalued prediction region (class label set
or real-valued interval) that, under relatively
weak assumptions, contains the test pattern’s
true output value with probability 1-E. In other
words, given a significance level E, a conformal
predictor makes an erroneous prediction with
probability E. The conformal prediction
framework allows any traditional classification
or regression model to be transformed into a
confidence predictor with little extra work,
both in terms of implementation and
computational complexity.
Some key properties
of conformal prediction are:
• We obtain
probabilities/error bounds per instance
• Probabilities are
well-calibrated: 95% means 95%
• We don't need to
know the priors
• We make a single
assumption - that the data is exchangeable ~
i.i.d.
• We can apply it to
any machine learning algorithm
• It is rigorously
proven and straightforward to implement
• There is no magic
involved – only mathematics and algorithms
Hence, confidence
predictors is an important tool that every data
scientist should carry in their toolboxes, and
conformal prediction represents a
straight-forward way of associating the
predictions of any predictive machine learning
algorithm with confidence measures.
This tutorial
aims to provide an introduction and an
example-oriented exposition of the conformal
prediction framework, directed at machine
learning researchers and professionals. A
publicly available Python library, developed by
one of the authors of the tutorial, will be used
for the running examples.
The goal of
the tutorial is to provide attendees with the
knowledge necessary for implementing functional
conformal predictors, and to highlight current
research on the subject.
|
Short Bio |
Prof. Ulf Johansson holds a M.Sc. in
Computer Engineering and Computer
Science from Chalmers University of
Technology, and a PhD degree in Computer
Science from the Institute of
Technology, Linköping University,
Sweden. Ulf Johansson’s research
focuses on developing machine learning
algorithms for data analytics. Most of
the research is applied, and often
co-produced with industry. Application
areas include drug discovery, health
science, marketing, high-frequency
trading, game AI, sales forecasting and
gambling. In 2011, he had his 15 minutes
of fame when called as an expert witness
in the Swedish Supreme Court regarding
whether Poker is a game of skill or
chance. In the court, Prof. Johansson
argued that skill predominates over
chance using, among other sources, his
paper “Fish or Shark – Data Mining
Online Poker”, originally presented at
DMIN 2009. Ulf Johansson has
published extensively in the fields of
artificial intelligence, machine
learning, soft computing and data
mining. He is also a regular program
committee member of the leading
conferences in computational
intelligence and machine learning.
During the last few years, Prof.
Johansson has published several papers
on conformal prediction, some presented
in top-tier venues like the Machine
Learning journal and the ICDM
conference.
|
Slides |
Conformal Prediction Slides |
Tutorial C |
Speaker |
Diego Galar,
Division of Operation and Maintenance
Engineering, Luleå University of Technology,
971 87 Lulea, Sweden,
diego.galar@ltu.se
|
Topic/Title |
eMaintenance and Industry 4.0: Knowledge
extraction in Industry and Transportation
|
Date & Time |
Tuesday, July 18, 06:00-07:00pm |
Location |
Copper Room |
Description |
Information is key to
successful and cost effective management of
maintenance process. Therefore the integration
of Information and Communication
Technologies(ICT) into the maintenance process
is essential to achieve excellence in
maintenance performance. e-Maintenance solutions
provide platform for seamless integration of ICT
with maintenance. In short, development and
implementation of eMaintenance solutions can be
considered as a fusion of various technologies
and methodologies.
The general opinion
among the asset managers is that the application
of information technology brings dramatic
results in machine reliability and O&M
(operations and maintenance) process efficiency,
however only few operations and maintenance
managers can show or calculate the benefits of
such applications. Technology providers are
trying to develop more and more advanced tools
while the maintenance departments seem to
struggle with the daily problems of
implementing, integrating and operating such
systems. The users combine their experience and
heuristics in defining maintenance policies and
in the usage of condition monitoring systems.
The resulting maintenance systems seem to be a
heterogeneous combination of methods and systems
in which the integrating factor between the
information and business processes is the
personnel. The information of the assets goes
through these human minds forming an
organizational information system and creating a
high reliance on the expertise of the as-set
management staff.
Indeed, the contribution
of operation and maintenance data to the
different asset management stages has been
triggered by the emergence of intelligent
sensors for measuring and monitoring the health
state of a component, the gradual implementation
of Information and Communication Technologies
(ICT), and the conceptualization and
implementation of e-maintenance.
During the last decade,
the global competition and advancement of ICT
have forced Production and Process Industries
through a continuous transformation and
improvement process. The business scenario is
focusing more on e-business intelligence to
perform transactions with a focus on customers’
needs for enhanced value and improvement in
asset management. Such business requirements
compel the organizations to minimize the
production and service downtime by reducing the
machine performance degradation. The above
organizational requirements stated in ISO 55000
necessitate the development of proactive
maintenance strategies to provide optimized and
continuous process performance with minimized
system breakdowns and maintenance. With these
changing systems of the busi-ness world in the
21st century, a new era of data e-services
coming from asset data has emerged.
The goal of this
tutorial is to provide a timely review of
research efforts on the topic of e-Maintenance
covering both theoretical and applied research
that will contribute towards understanding the
strategic role of eMaintenance solutions in the
business and create a volume of recent work on
this subject. We hope that this tutorial will
stimulate new areas of thinking in management of
complex maintenance processes which are
increasing the relevance in the organizations.
Indeed organizations are
under pressure to continually enhance their
ability to create value for their customers and
improve the cost effectiveness of their
operations. In this regard, the maintenance of
large-investment assets, once thought to be a
necessary evil, is now considered a key function
for improving the cost effectiveness of
operations and creating additional value by
delivering better and more innovative services
to customers.
With the change in
strategic thinking of organizations, the
increasing amount of outsourcing and the
separation of OEMs and asset owners, it is
crucial to measure, control and improve the
asset maintenance performance. Today, with the
advances in technology, various maintenance
strategies have evolved, including condition
based maintenance, predictive maintenance,
remote-maintenance, preventive maintenance,
e-maintenance etc. A main challenge faced by
most organizations is choosing the most
efficient and effective strategies to enhance
and continually improve operational
capabilities, reduce maintenance costs and
achieve competitiveness in the industry.
|
Short Bio |
Prof. Diego Galar
holds a M.Sc. in Telecommunications and a
PhD degree in Design and Manufacturing from
the University of Saragossa. He has been
Professor in several universities, including
the University of Saragossa or the European
University of Madrid, researcher in the
Department of Design and Manufacturing
Engineering in the University of Saragossa,
researcher also in I3A, Institute for
engineering research in Aragon, director of
academic innovation and subsequently
pro-vice-chancellor.
He has authored
more than two hundred journal and conference
papers, books and technical reports in the
field of maintenance, working also as member
of editorial boards, scientific committees
and chairing international journals and
conferences.
In industry, he has been
technological director and CBM manager
of international companies, and actively
participated in national and
international committees for
standardization and R&D in the topics of
reliability and maintenance.
Currently, he is
Professor of Reliability and Maintenance in
Skovde University, holding the VOLVO chair
for maintenance, and Professor of Condition
Monitoring in the Division of Operation and
Maintenance Engineering at LTU, Luleå
University of Technology, where he is
coordinating several EU-FP7 projects related
to different maintenance aspects, and was
also involved in the SKF UTC center located
in Lulea focused in SMART bearings. He is
also actively involved in national projects
with the Swedish industry and also funded by
Swedish national agencies like Vinnova.
In the
international arena, he has been visiting
Professor in the Polytechnic of Braganza
(Portugal), University of Valencia and NIU
(USA), currently, University of Sunderland
(UK) and University of Maryland (USA). He is
also guest professor in the Pontificia
Universidad Católica de Chile.
|
|
|