Interpreting business and social data can help us understand a huge range of subjects including political impact, environmental concerns, investment decisions, ethical issues, and improving sales processes. Previously the time and energy required to collect and interpret this data was beyond the reach of most individuals and even corporations found it difficult to justify the investment in a data-driven approach.
There are now many sources of cool data from the government, non-profit organizations and even companies themselves which can be downloaded, combined and mined. Unfortunately, these data sources aren’t well advertised and can be hard to interpret. In this talk I aim to give you some ideas of what’s out there and some ideas of how you can use it to do interesting things.
- Politicians and their relationships with various companies
- Data from SEC filings, NAICS codes, and the US Census Bureau
- Automatically finding the “buzz” about different companies
- Ways to visualize different sources to know the right questions to ask
- Generating implied social networks
- Specialized data from other agencies like the FDA and the USPTO
- Combining data from many sources to learn new things
Toby Segaran is the author of the O’Reilly title, “Programming Collective Intelligence”, Amazon’s top-selling AI book, and the Data Magnate at Metaweb Technologies. Prior to Metaweb he founded and sold a biotechnology software company. He loves applying data-mining algorithms to everything ranging from pharmaceutical trials to bloggers and online dating.
Have a suggestion for a speaker or topic at Web 2.0 Expo San Francisco? Send an email to: firstname.lastname@example.org
View a complete list of Web 2.0 Expo contacts.