1. 你迟早要意识到,营利不是唯一的目的,赚钱不过是手段。 你如果想创建一家伟大的公司,你需要一个真正的目标,一个能让这个世界变的更好的目标。

2. 政府在信息化建设方面远远落后于私营企业,我们有太多的事情可以去帮助政府建设更好的公共服务,更完善的养老体制,更加公平的司法,更人性化的公共服务,疾病和罪犯的预防,经济走向的预测。

3. 在信息公开化方面,我们不仅仅要做到公开研究成果,更要公开研究背后的数据,如此,其他人才能重现和验证这些研究成果。

Tim O’Reilly, CEO of O’Reilly Media, one of Forbes’ top 7 data scientists in the world, and a Silicon Valley visionary who coined the term Web 2.0, accepted Jake Porway’s (DataKind) questions about using data science for social good. Please read on to see his thought provoking responses below. 
1. How can we ensure that nonprofits have access to the same data science resources most big companies take for granted?
More companies should follow the lead of Planet Labs (planet.com) and set up a corresponding .org (planet.org) early in their development.  It's never too early to remember that profit is not our only purpose.  In fact, profit is a means, not an end.  If we want to build a great company, we need a real purpose, and that means doing things that make the world a better place.
2. What cause or issue area are you passionate about where you think data science could make a big impact?
Improving government services. There is so much potential in technology to build services that make people's lives better. When government falls so far behind the private sector, people lose faith in it.  Yet it is the one institution that is supposed to work for all of us. So we need to help make it so. At Code for America, we're working in areas such as access to social services, criminal justice (e.g. data science to predict people best eligible for alternatives to incarceration), health, and so on.
3. What are your favorite examples of data used for the greater good?
The whole enterprise of science is data for the greater good. Which is why open access to scientific research is so important.  And open access needs to include not just access to the scientific literature, but also to the underlying data, so that research can be reproduced and validated by others.  It's time to bring scientific publishing into the 21st century. I'm also really fond of the way that Civis Analytics, a startup created by the data team from the 2012 Obama campaign, is using "get out the vote" technology for other social problems, like getting people into healthcare, or getting smart but poor kids to apply to better schools. And I think that the work that Nathan Wolfe at Metabiota, and their associated non-profit, the Global Viral Forecasting Institute, to build a data-driven 21st century equivalent to the CDC is super important in this age of new infectious disease outbreaks, such as we are seeing with the current Ebola outbreak in West Africa.  And finally, while I've said above that government is a laggard in many technology areas, it's important to remember that without government weather and GPS satellites, many contemporary consumer services wouldn't exist, and our agricultural sector would be far less productive.

大数据相关的场景比较多,常见的有:ETL(数据提取、转换、加载)、实时流式(监控报警、风控等)、机器学习(推荐引擎、用户画像等)、非结构化分析(视频、图片、语音、文本等)、海量大数据在线存储(HBase)、搜索及我们本文讲的OLAP。 其中OLAP(在线联机分析)在很多企业占住分析类的大部分。
1、云计算与大数据是什么关系?   云计算的关键词在于“整合”,无论你是通过现在已经很成熟的传统的虚拟机切分型技术,还是通过google后来所使用的海量节点聚合型技术,他都是通过将海量的服务器资源通过网络进行整合,调度分配给用户,从而解决用户因为存储计算资源不足所带来的问题。
