Loading...
Answers
MenuHow big data can evolve if our overall ability to convert data into really valuable info that then can drive effective decision making is so limited?
This question has no further details.
Answers
I would disagree with the premise that "big data" isn't being converted into valuable information. Modern advertising is being driven by exactly this ability. The problem lies not with the information, or lack thereof, but with the decision makers. Lack of vision at both the strategic and tactical levels of decision, fear of change, among other things are at the root of the problem.
I also agree. It really depends on what you do with the data. Even a small bit of data can mean a lot.
The other problem is that the people gathering the data and analyzing it are sometimes people with PhDs and not MBAs. We are talking about a very technical thing here and the industry is growing. Just give it some time and I guarantee it'll evolve. So how you ask? Time, that's how. Trial and error. The risk takers will lead the way.
To be honest, I do believe many people are wasting too much money on huge data warehouses. I firmly believe we can get the data we need to act on without going so crazy. We're all too impressed with size. I mean, maybe if you're trying to clone Dolly or are the NSA or something, you can't avoid all that data...But size doesn't matter. Results do.
For example, a 5 terabyte data set of Twitter messages from random people no one cares about talking about what they ate for breakfast? Useless.
I know, it's unfair...We've spent millions of dollars learning about breakfast foods from Twitter and Tony the Tiger's family is still starving. Yup, it happens.
Just give it time. Big data and data mining on the web is still in its infancy in terms of application.
I am not at all convinced that our ability to convert data to actionable decision making metrics is limited.
In fact, I do just that every day.
There are definitely limiting factors, but the ability to make use of the data is much stronger than people commonly believe.
For structured data you just need a good understanding of the data structures and their relationships. Unstructured data (tweets, statuses, etc.) is much more difficult but there are several technologies available and under development that tackle unstructured data specifically. This even includes some high end tools that monitor voice data.
If you have a specific data scenario in mind I can be more specific. If you'd like to arrange a call I'd be happy to provide more detail.
Related Questions
-
What is the viability of big data in education? Is learning analytics a potentially viable market considering current restraints on public education?
Qualifications on the answer: Created the first website to be ever commercially licensed by a ministry of education for in-school use (Brainium 1996). Was a VC associate at a 500,000,000 firm called Knowledge Universe focused on all things education-related that intersected with technology. Short Answer: Yes and no. Longer Answer: Learning analytics is one of those "lightening in a bottle" kind of industries. Everyone knows it's going to happen, whomever makes it happen at scale first, will have a huge advantage, and even more niche plays in this space will still create massive value for shareholders and significant transformation of education as we know it. That's the good news. The problem is that funding startups in the education space that are dependent on "permission" being granted by existing institutions and their employees is very much hit or miss. It's getting easier to raise a $500-800k seed round for a good idea with a good team, but the problem is that getting the next bigger round is very challenging in that the models of most of these seed-funded companies are not growing fast enough to prove to institutional investors that a growth model of the idea being very big in a relatively short time-period is credible. So if you're intent on pursuing this, I'd be sure to focus entirely on how you can get growth and adoption with requiring little to no institutional buy-in which sounds almost impossible on the surface of the area you're exploring but I believe isn't actually impossible. I'm very passionate about this space and want something like what you're describing to succeed so would be happy to do a quick call to hear where you're at and see if I can provide you some actionable ideas on how to reduce the friction / improve adoption of your analytics product.TW
-
How do I find the Total Addressable Market for a Big Data product like Datameer?
As someone who has built TAM models for various industries - I can attest this isn;t always easy. Many analyst firms (former analyst here) typically do some surveys and research among public company revenue reports for the year and then do some pixie dust extrapolating. The hard thing about a market where startups like Datameer play is that it is a) a nascent market and b) not a "zero sum" game. This makes it difficult to a) fully understand WHICH companies actually are in the market for this flavor of BI and b) know how the space is developing in terms of white space AND disrupting the entrenched BI giants. You might want to look at the press published versions (read: free) of analyst market share reports, and extrapolate from their. Also, back-channel gossip may give you some revenue numbers on these players - and if you assume pipeline is 3-4x revenue, build a model from there. Hope this helps.MS
-
How can I aggregate data from online sources about a specific topic?
There are so many ways to do it... Do you need this data for yourself, or you are planning to make a product around it? From what I see you can use Twitter API and Facebook Graph API (Are you comfortable programming?) Most of the students are active on social media so you will find lots of data. Facebook graph API will give you a number of likes and comments to all the posts of you competitors. You can analyze all the posts of your competitors. Using Twitter API you can get all the twits that use certain hashtags or mentions. If you are not into coding, but still want to get social media information, you can take a look at tools like IBM Watson ANalytics ($30 for personal use), it natively connects to Twitter API, and you don't have to be a programmer at all. It is intuitive and easy to learn. Analytics Canvas connects to Facebook Graph API (it's free for 30 days of trial). Unfortunately, you would not be able to collect any personal information from social media at large scale (age, income, gender, etc.), because it violates all the laws about privacy on the Internet. You can use census data instead. Google Sheets are a very handy tool if you are planning to use this information for personal research. You can set up a spreadsheet and add some Java script to make it collect all information from competitor's blogs, and also sites like Reddit. Finally, you can try web scraping (it's not the best, but can speed up the process). A tool like OutWitHub will collect information from websites (such as website reviews) based on the structure you provide (select html tags). You can collect thousands of reviews in one day if you automate it (paid version). Very easy to use. Note: not all the websites are open to this method, review their policies to make sure you are not violating their terms of service. Reviews belong to the website where they were published. If you REALLY need personal data (like how much they earn and how much they spend, etc.), just print out 100 questionnaires and go to Student Union Building of Dalhousie University. Most of the students will share any personal data in exchange for a Tim Horton's gift card that gets them a free coffee. It is probably the least technical and fastest way to get all the data you need. Hope this helps.OT
-
If you were to build a freelance marketplace for data scientists and data analysts, what kind of companies and projects would you target?
It's unlikely that companies would look to outsource such a critical component and also it would be near impossible to create trust around 3rd parties accessing their data especially via an intermediary service.TW
-
Do services social monitoring tools like Mention.com, Socialmention.com, and Nitrogram rely soley on APIs or do they crawl or cache the meta data?
It's hard to get a definite answer to this as these companies will not tell us how their algorithms work. Having said that: Based on my experience working with these APIs, it is totally possible to implement a real-time search like "socialmention.com" purely based on the APIs from Google, Twitter, Yahoo, or Facebook. On the other hand, crawling and saving all that data just for the eventuality that a customer might search for it is probably not economically viable and will also violate the Terms of Services of most of these APIs. Bottom line is: They are probably not crawling and caching.GF
the startups.com platform
Copyright © 2025 Startups.com. All rights reserved.