Loading...
Answers
MenuAthena, redshift, bigquery?
Answers
Hello, I'm Muntasir Al Qawasmi
I would like to say, in general terms, that the use of what was mentioned, Google and Amazon are quick and innovative solutions, and if used, it is for backup copies.
But in the case of a large company and a large number of daily data, I recommend creating a special program using Python that reads the data and delivers as appropriate for your business policy
I have been helping clients design, build, and deploy data platforms for many years. Initially on premise, but now in the cloud. I think your choice of technology is less important than your chosen data architecture. I would focus on defining this data architecture first (i.e. to satisfy your requirements) and then looking at how the Azure, AWS, or GCP clouds could best satisfy your requirement. I would consider things like data ingestion, orchestration, modelling, and exploitation. The biggest challenges won't be technology, but thinkings like 1) how do you model the data to answer your questions and 2) how do you relate/join the various data sources. I hope this helps; feel free to pop in a call of you would like to delve into more detail or if you have any questions.
Related Questions
-
How do I set up a WordPress MU site using multiple databases? The site will become very large if it's all on one database.
Check out these WordPress plugins: https://wordpress.org/plugins/shardb/ and https://premium.wpmudev.org/project/multi-db/ Good luck!BB
-
How can I know when to go with my gut or when to trust the data when making a business decision?
In my experience coaching hundreds of people, anytime you go against your gut (aka intuition) you will lose. Data is important to consider, but your gut often knows the path.RC
-
How can I aggregate data from online sources about a specific topic?
There are so many ways to do it... Do you need this data for yourself, or you are planning to make a product around it? From what I see you can use Twitter API and Facebook Graph API (Are you comfortable programming?) Most of the students are active on social media so you will find lots of data. Facebook graph API will give you a number of likes and comments to all the posts of you competitors. You can analyze all the posts of your competitors. Using Twitter API you can get all the twits that use certain hashtags or mentions. If you are not into coding, but still want to get social media information, you can take a look at tools like IBM Watson ANalytics ($30 for personal use), it natively connects to Twitter API, and you don't have to be a programmer at all. It is intuitive and easy to learn. Analytics Canvas connects to Facebook Graph API (it's free for 30 days of trial). Unfortunately, you would not be able to collect any personal information from social media at large scale (age, income, gender, etc.), because it violates all the laws about privacy on the Internet. You can use census data instead. Google Sheets are a very handy tool if you are planning to use this information for personal research. You can set up a spreadsheet and add some Java script to make it collect all information from competitor's blogs, and also sites like Reddit. Finally, you can try web scraping (it's not the best, but can speed up the process). A tool like OutWitHub will collect information from websites (such as website reviews) based on the structure you provide (select html tags). You can collect thousands of reviews in one day if you automate it (paid version). Very easy to use. Note: not all the websites are open to this method, review their policies to make sure you are not violating their terms of service. Reviews belong to the website where they were published. If you REALLY need personal data (like how much they earn and how much they spend, etc.), just print out 100 questionnaires and go to Student Union Building of Dalhousie University. Most of the students will share any personal data in exchange for a Tim Horton's gift card that gets them a free coffee. It is probably the least technical and fastest way to get all the data you need. Hope this helps.OT
-
How do I find the Total Addressable Market for a Big Data product like Datameer?
As someone who has built TAM models for various industries - I can attest this isn;t always easy. Many analyst firms (former analyst here) typically do some surveys and research among public company revenue reports for the year and then do some pixie dust extrapolating. The hard thing about a market where startups like Datameer play is that it is a) a nascent market and b) not a "zero sum" game. This makes it difficult to a) fully understand WHICH companies actually are in the market for this flavor of BI and b) know how the space is developing in terms of white space AND disrupting the entrenched BI giants. You might want to look at the press published versions (read: free) of analyst market share reports, and extrapolate from their. Also, back-channel gossip may give you some revenue numbers on these players - and if you assume pipeline is 3-4x revenue, build a model from there. Hope this helps.MS
-
Is there demand for a service that sells database of contact information of varied sectors and geographies? If so, how do I market this product?
There are a lot of players in this market. Many that integrate with all the major CRMs and append and clean data. I would say I get at least 1 email a day from a data vendor and I have been on countless demos and calls and because my client is so niche they never have more 5-10 records even viable for me. I am sure for larger companies with broader markets this works well. They all tend to have very aggressive sales outreach. I have one company that has called me every 3 months for the past 2 years. At this point hes basically just asking about my health and the weather :)NP
the startups.com platform
Copyright © 2025 Startups.com. All rights reserved.