参加はできませんでしたが、Making Data Work: Strata Conference + Hadoop World - O'Reilly Conferences, October 23 - 25, 2012, New York, NYの内容は多少なりとも追いかけたいと思ったので、 Keynote だけでも観るために、個人的な Todo リストとするためにビデオを集めてみました。
1 つ 1 つは 10-20 分前後なので、スキマ時間に iPad ででも少しづつ觀ていこうと思っています。
とは言え、 15 本、なかなかボリューミーではあるな、と。
Society confronts enormous challenges today: How will we feed nine billion people? How can we diagnose and treat diseases better, and more cheaply? How will we produce more energy, more cleanly, than ever before?
Big questions like these demand new approaches, and "Big Data" is a crucial of the toolkit we will use over the coming years to answer them. New algorithms, applied to much more raw data than has ever been available before, will help professionals in almost every discipline make better, more informed decisions, and will guide research and policy toward better outcomes, faster.
Born in the consumer internet, the Apache Hadoop platform has, over the last six years, become a critical piece of infrastructure for government, commercial and research organizations that need to answer big questions using Big Data. In his opening keynote, Mike will explore some revolutionary use cases form his own experiences at Cloudera and will show how building applications within a broader community and ecosystem has vast implications for the speed and depth of innovation, helping humanity to ask bigger questions and gain bigger answers.
Data science is a team sport. Collaboration inside and outside your organization is the ultimate Big Data technique. Success depends on having a collaboration platform and solving the number one problem of the Big Data era: the supply and demand for data scientists. Learn how you can take action today to accelerate the success of your data science efforts.
This keynote is sponsored by Greenplum, a division of EMC
The End of the Data Warehouse
Hadoop is scalable, inexpensive and can store near-infinite amounts of data. But driving it requires exotic skills and hours of batch processing to answer straightforward questions. Learn how everything is about to change.
Moneyball for New York City
New York City is a complex, thriving organism. Hear how data science has played a surprising and effective role in helping the city government provide services to over 8 million people, from preventing public safety catastrophes to improving New Yorkers' quality of life.
Much of the heavy lifting involved with Big Data projects is accessing and preparing the data for analysis or what is often referred to as data integration. This can easily consume up to 80% of a big data development effort and yet too many developers resort to reinventing the wheel by hand-coding custom connectors, data parsers, and data integration transformations. Why not leverage a metadata-driven, codeless IDE with pre-built transformations and data quality rules so that custom development time can better be spent where it's truly needed? Codeless environments have proven to be up to 10 times more productive than hand coding, are less error prone and easier to maintain. The skills are already out there and available from in-house IT or system integrators making it possible to get your projects running and into production quickly.
This keynote is sponsored by Informatica
The Composite Database
While moving away from single powerful servers, distributed databases still tend to be monolithic solutions. But e.g. key-value storage is rapidly becoming a commodity service, on which richer databases might be built. What are the implications?
Big Data Direct -- The Era of Self-driven Big Data Exploration
In recent years, "Big Data" has matured from a vague description of massive corporate data to a household term that refers to not just volume but the diversity of data and velocity of change. Today, there's a wealth of data trapped in corporate data repositories, new platforms like Hadoop, a new generation of data marketplaces and volumes generated hourly on the Web. With the opportunity for key insights that these diverse data sources present, the business user's ability to get to the data when they need it and gleam fast insights has become a massive priority. In a nutshell, easing access and analysis of both private and public data is one of the biggest opportunities ahead. New approaches to enable self-driven exploration of private and public data are necessary and will help address the critical 'last mile' problem in big data. Big Data Direct discusses the opportunity ahead for business users to intuitively and easily harness the power of private and public data for deeper customer intelligence and to identify new business opportunities.
Bringing the 'So What' to Big Data
The onset of the Big Data phenomenon has created a unique opportunity to improve the human condition, but the challenge ahead of us is to move beyond Big Data infrastructure to real, applied, and prioritized comprehension that is morally and practically useful. This requires redirecting our collective energies toward new algorithms, more distributed systems, and purer software architectures that more optimally exploit the infrastructure to answer questions of great social and personal value. Technologies that close the "Understanding Gap" can make great strides to prevent evil, reduce suffering, and create more actualized human potential. This pursuit is more than an opportunity- it is a key responsibility for the technology community today and through at least the next decade.
The Human Face of Big Data
Over the past two decades, Rick Smolan, creator of the best selling "Day in the Life" books, has produced a series of ambitious global projects in collaboration with hundreds of the world's leading photographers, writers, and graphic designers. This year Smolan invited more than 100 journalists around the globe to explore the world of Big Data. The Human Face of Big Data captures, in fascinating photographs and moving essays, an extraordinary revolution sweeping, almost invisibly, through business, academia, government, healthcare, and everyday life. Big Data is already enabling us to provide a healthier life for our children, to provide our seniors with independence while keeping them safe, to help us conserve precious resources like water and energy, to peer into our own individual genetic makeup, to create new forms of life, and soon, many predict, to reengineer our own species... and we've barely scratched the surface.
Hadoop: Thinking Big
Most organizations have limited their thinking about Hadoop. The use cases they pursue are narrow and have only scratched the surface on how to best improve business results and gain a competitive edge.
The truth is that there are just a few obstacles to overcome and a few changes in perspective, to realizing the full potential for Hadoop.
This session will provide insights into how the combination of scale, efficiency, and analytic flexibility creates the power to expand the applications for Hadoop to transform companies as well as entire industries.
This keynote is sponsored by MapR Technologies
Hadoop started as an offline, batch-processing system. It made it practical to store and process much larger datasets than before. Subsequently, more interactive, online systems emerged, integrating with Hadoop. First among these was HBase, the key/value store. Now scalable interactive query engines are beginning to join the Hadoop ecosystem. Realtime is gradually becoming a viable peer to batch in big data.
Cloud, Mobile and Big Data -- How Analytics Provides Value to the Buzzwords
In this rapid-fire keynote, we'll introduce how virtually every new technology trend is inextricably linked -- or should be to attain maximum leverage. We'll discuss how you can ride the Big Data wave by leveraging analytics to drive superior and faster decisions -- decisions that can lead to competitive advantage. We'll discuss how you can use technologies such as cloud and mobility to spread the value of analytics pervasively across your virtual organization, and how that positively impacts your employees, customers and partners.
This keynote is sponsored by SAS
From Traditional Database to Big Data Platform
You need more than a database 'hammer' for today's Big Data projects. Organizations need a 'data platform' providing integrated tools to capture, store, process and present data. Without it companies can achieve -- volume, velocity, or variety -- but not all three. Join us to learn the extreme capabilities needed to distill new business signals from big data.
This keynote is sponsored by SAP
Are We Really Winning the Information Revolution?
Samantha Ravich, former National Security Advisor to Vice President Richard Cheney, will discuss the challenges that face strategic decision makers from the wealth of data now provided by advances in technology.
Of Rocket Ships and Washing Machines: Data Technology for People
The story of Big Data technology has centered on engines, algorithms, and statistical methods for data analysis. Less has been said -and too little has been done-regarding technology to improve the lives of data analysts. In this talk I'll highlight recent research from Berkeley and Stanford targeted at improving productivity across the data lifecycle, using technology to address the scarcest resource in Big Data: people.