Dear AVian,
2015 was year of growth for us – we transformed from a blog to a community of data scientists. We launched our discussion portals, hackathon platform and job portal – and each of this initiative is shaping up well. Our mission of building a nation of data scientists is coming live every day. And all of this happened only because of your enthusiasm and support. We couldn’t have thought of this journey without direction from you. Your comments, emails, letters and queries helps us decide our next steps.
So, on this day of new year, I want to hear more from you. I want to hear from you what bothers you the most about data science? What aspects of data science you find hard to learn? What is blocking your successful journey? What is one thing you would want us to do right in 2016?
Simply reply to this email on write in comments below.
I don’t know how things will turn out. But, I promise, I’ll try my best to give you many reasons of happiness & joy in 2016.
Wishing you all a year full of joy, health, curiosity, knowledge and perseverance.
I’m waiting for your reply.
Regards
Kunal Jain
Founder & CEO
Analytics Vidhya
Kunal Jain is the Founder and CEO of Analytics Vidhya, one of the world's leading communities of AI professionals.
With over 17 years of experience in the field, Kunal has been instrumental in shaping the global AI landscape. His expertise spans diverse markets, from developed economies like the UK to emerging ones like India, where he has successfully led and delivered complex data-driven solutions. As a recognized thought leader, Kunal has empowered countless individuals to realize their AI ambitions through his visionary approach to AI education and community building.
Before founding Analytics Vidhya, Kunal earned both his undergraduate and postgraduate degrees from IIT Bombay and held key roles at Capital One and Aviva Life Insurance across multiple geographies. His passion lies at the intersection of analytics, AI, and fostering a thriving community of data science professionals.
It’s our 3rd Birthday – Come &...
AVturns2: Let the celebrations begin!
Analytics Vidhya turns 4 – A journey from...
Improvising Hackathon platform, Blogathon, Prof...
Our new section – Stories and Why I am su...
5 years of building Analytics Vidhya – ou...
Excitement going up at Analytics Vidhya (and I ...
Welcome 2015 with new, better and more helpful ...
Welcome 2017 – Are you prepared for a yea...
Highlights of 2013
We use cookies essential for this site to function well. Please click to help us improve its usefulness with additional cookies. Learn about our use of cookies in our Privacy Policy & Cookies Policy.
Show details
This site uses cookies to ensure that you get the best experience possible. To learn more about how we use cookies, please refer to our Privacy Policy & Cookies Policy.
It is needed for personalizing the website.
Expiry: Session
Type: HTTP
This cookie is used to prevent Cross-site request forgery (often abbreviated as CSRF) attacks of the website
Expiry: Session
Type: HTTPS
Preserves the login/logout state of users across the whole site.
Expiry: Session
Type: HTTPS
Preserves users' states across page requests.
Expiry: Session
Type: HTTPS
Google One-Tap login adds this g_state cookie to set the user status on how they interact with the One-Tap modal.
Expiry: 365 days
Type: HTTP
Used by Microsoft Clarity, to store and track visits across websites.
Expiry: 1 Year
Type: HTTP
Used by Microsoft Clarity, Persists the Clarity User ID and preferences, unique to that site, on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.
Expiry: 1 Year
Type: HTTP
Used by Microsoft Clarity, Connects multiple page views by a user into a single Clarity session recording.
Expiry: 1 Day
Type: HTTP
Collects user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.
Expiry: 2 Years
Type: HTTP
Use to measure the use of the website for internal analytics
Expiry: 1 Years
Type: HTTP
The cookie is set by embedded Microsoft Clarity scripts. The purpose of this cookie is for heatmap and session recording.
Expiry: 1 Year
Type: HTTP
Collected user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.
Expiry: 2 Months
Type: HTTP
This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number of visitors, the source where they have come from, and the pages visited in an anonymous form.
Expiry: 399 Days
Type: HTTP
Used by Google Analytics, to store and count pageviews.
Expiry: 399 Days
Type: HTTP
Used by Google Analytics to collect data on the number of times a user has visited the website as well as dates for the first and most recent visit.
Expiry: 1 Day
Type: HTTP
Used to send data to Google Analytics about the visitor's device and behavior. Tracks the visitor across devices and marketing channels.
Expiry: Session
Type: PIXEL
cookies ensure that requests within a browsing session are made by the user, and not by other sites.
Expiry: 6 Months
Type: HTTP
use the cookie when customers want to make a referral from their gmail contacts; it helps auth the gmail account.
Expiry: 2 Years
Type: HTTP
This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.
Expiry: 1 Year
Type: HTTP
this is used to send push notification using webengage.
Expiry: 1 Year
Type: HTTP
used by webenage to track auth of webenagage.
Expiry: Session
Type: HTTP
Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.
Expiry: 1 Day
Type: HTTP
Use to maintain an anonymous user session by the server.
Expiry: 1 Year
Type: HTTP
Used as part of the LinkedIn Remember Me feature and is set when a user clicks Remember Me on the device to make it easier for him or her to sign in to that device.
Expiry: 1 Year
Type: HTTP
Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries.
Expiry: 6 Months
Type: HTTP
Used to store information about the time a sync with the AnalyticsSyncHistory cookie took place for users in the Designated Countries.
Expiry: 6 Months
Type: HTTP
Cookie used for Sign-in with Linkedin and/or to allow for the Linkedin follow feature.
Expiry: 6 Months
Type: HTTP
allow for the Linkedin follow feature.
Expiry: 1 Year
Type: HTTP
often used to identify you, including your name, interests, and previous activity.
Expiry: 2 Months
Type: HTTP
Tracks the time that the previous page took to load
Expiry: Session
Type: HTTP
Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings
Expiry: Session
Type: HTTP
Tracks percent of page viewed
Expiry: Session
Type: HTTP
Indicates the start of a session for Adobe Experience Cloud
Expiry: Session
Type: HTTP
Provides page name value (URL) for use by Adobe Analytics
Expiry: Session
Type: HTTP
Used to retain and fetch time since last visit in Adobe Analytics
Expiry: 6 Months
Type: HTTP
Remembers a user's display preference/theme setting
Expiry: 6 Months
Type: HTTP
Remembers which users have updated their display / theme preferences
Expiry: 6 Months
Type: HTTP
Used by Google Adsense, to store and track conversions.
Expiry: 3 Months
Type: HTTP
Save certain preferences, for example the number of search results per page or activation of the SafeSearch Filter. Adjusts the ads that appear in Google Search.
Expiry: 2 Years
Type: HTTP
Save certain preferences, for example the number of search results per page or activation of the SafeSearch Filter. Adjusts the ads that appear in Google Search.
Expiry: 2 Years
Type: HTTP
Save certain preferences, for example the number of search results per page or activation of the SafeSearch Filter. Adjusts the ads that appear in Google Search.
Expiry: 2 Years
Type: HTTP
Save certain preferences, for example the number of search results per page or activation of the SafeSearch Filter. Adjusts the ads that appear in Google Search.
Expiry: 2 Years
Type: HTTP
Save certain preferences, for example the number of search results per page or activation of the SafeSearch Filter. Adjusts the ads that appear in Google Search.
Expiry: 2 Years
Type: HTTP
Save certain preferences, for example the number of search results per page or activation of the SafeSearch Filter. Adjusts the ads that appear in Google Search.
Expiry: 2 Years
Type: HTTP
These cookies are used for the purpose of targeted advertising.
Expiry: 6 Hours
Type: HTTP
These cookies are used for the purpose of targeted advertising.
Expiry: 1 Month
Type: HTTP
These cookies are used to gather website statistics, and track conversion rates.
Expiry: 1 Month
Type: HTTP
Aggregate analysis of website visitors
Expiry: 6 Months
Type: HTTP
This cookie is set by Facebook to deliver advertisements when they are on Facebook or a digital platform powered by Facebook advertising after visiting this website.
Expiry: 4 Months
Type: HTTP
Contains a unique browser and user ID, used for targeted advertising.
Expiry: 2 Months
Type: HTTP
Used by LinkedIn to track the use of embedded services.
Expiry: 1 Year
Type: HTTP
Used by LinkedIn for tracking the use of embedded services.
Expiry: 1 Day
Type: HTTP
Used by LinkedIn to track the use of embedded services.
Expiry: 6 Months
Type: HTTP
Use these cookies to assign a unique ID when users visit a website.
Expiry: 6 Months
Type: HTTP
These cookies are set by LinkedIn for advertising purposes, including: tracking visitors so that more relevant ads can be presented, allowing users to use the 'Apply with LinkedIn' or the 'Sign-in with LinkedIn' functions, collecting information about how visitors use the site, etc.
Expiry: 6 Months
Type: HTTP
Used to make a probabilistic match of a user's identity outside the Designated Countries
Expiry: 90 Days
Type: HTTP
Used to collect information for analytics purposes.
Expiry: 1 year
Type: HTTP
Used to store session ID for a users session to ensure that clicks from adverts on the Bing search engine are verified for reporting purposes and for personalisation
Expiry: 1 Day
Type: HTTP
Cookie declaration last updated on 24/03/2023 by Analytics Vidhya.
Cookies are small text files that can be used by websites to make a user's experience more efficient. The law states that we can store cookies on your device if they are strictly necessary for the operation of this site. For all other types of cookies, we need your permission. This site uses different types of cookies. Some cookies are placed by third-party services that appear on our pages. Learn more about who we are, how you can contact us, and how we process personal data in our Privacy Policy.
Edit
Resend OTP
Resend OTP in 45s
Dear Kunal, Happy New Year and an even more successful 2016. Keep up the good work. I am currently in Bombay and will be heading back to USA on the 5th. Pl. let me k now if you are travelling around this are in the next 2 days. Best Regards, Srikar 202-445-0183
Srikar, Looks like I'll miss you. I plan to be in Mumbai a couple of weeks down the line. All the best for your trip. Regards, Kunal
The biggest problem I have had is access to realistic data. I would like to see a repository of "scan data", mfg rep sales data, etc., and other types of data sets for analysis and feedback from other Data Analysts on how they approached the problem. It is extremely difficult to get good realistic data as companies jealously guard their data and trade secrets.
Alvin, This sounds a lot like our hackathons. Do check them out - http://datahack.analyticsvidhya.com We release data after working with our clients for these hackathons and the community usually shares their solutions on our discussion platform. Hope to see you competing in our hackathons this year. Regards, Kunal
Dear kunal HAPPY NEW YEAR Regards. kishore
Thanks Kishore. Same to you.
I am totally new in analytics but i got almost 10 years of experince in stock market,i want to build some stock price prediction model can we do that
Sagar, We are working on creating a problem on these lines. Regards, Kunal
Happy new year Kunal
Happy New Year to you too Golam Kabir.
Hi Kunal, Wish you and your team a very happy new year. All around the web we only find analysis on small sample of available datasets. It would be a great idea for a section on analyzing extremely large datasets (million records strong). I have learned that a completely different approach is required with such large datasets, as you not only focus on the algorithm, but also on the data, speed and efficiency of the algorithm used. I am looking around for more guidance on the subject, and would appreciate if AV could do a series on this. Regards, Jigar
Thanks Jigar for the suggestion. We will definitely put up a series on this. In the meanwhile, do look out for our hackathon datasets - they are reasonably big and can help you get flavour of big data sets. Regards, Kunal
Happy new year Kuna ! Keep going on what you're doing. You do a very good job. It help me a lot in my work and in my personal development. I just hope that you can create something like learning paths but devoted to more theory in order to "unblackbox" statistical tools like random forests for example.
Amine, Thanks for your suggestion. We will definitely work on it. Regards, Kunal
Hello Kunar, First of all thanks to you and your friends for this usefull site. I wish you a great 2016. In my experience is one of a kind, and you and Andrew Ng (I'm quite sure you heard of him) are my personal heroes. So a deep thanks to you and Andrew for sharing your deep knowledge. My personal barrier to learn is to find someone giving me credits enough to work for him on real task. I tried also to offer my services for free, but I was unlucky. So my target for next year is to read all your articles, and try to participate in a competition, hardening myself until I will find a job in this industry. Gianfranco from Italy wishes to all friends here a very good 2016 !
Dear Gianfranco, Thanks for those wonderful wishes. Prof. Andrew Ng has made huge contribution to the field. While it is flattering to be mentioned along side him, I have a long way to go before I can come any where close to the contributions he has made. Looking forward to interact with you further in 2016. Regards, Kunal
Hi Kunal, Happy New year. You are doing a great job and I would like to thank you for starting AV and actively working on improving the platform. I would like to suggest you something. I've an intermediate level of knowledge in data science and big data technologies. I used to participate in some Kaggle competitions also. But Hackathon and competitions are platforms that focus on model building and optimizing. I believe that in real projects, data collection and feature engineering steps have importance. I wish if there were some opensource projects or platforms were people can collaboratively work on solving real world issues. For example exploring data provided by government, NGOs, Survey platforms, etc. and developing a good insights from it. I wish you even more successful 2016. Thanks & Regards, Niyas
Thanks Niyas for the suggestion. We will definitely work on it. Regards, Kunal
Happy new year 2016! The best wishes for everybody. I would like to read more about multivariate data analysis, specific for QlikView. Thank you Hv
Thanks Hector. Happy new year to you too. Will make sure we address this need. Regards, Kunal
Hi Kunal, Wish you happy new year!!! Currently I m working as qlikview developer. I want to move to analytics. Kindly help me further learning. Regards Viresh
Viresh, Happy New Year to you too. You can start by following our learning path on R or Python. Regards, Kunal
You are really doing a good job, now I recommend to start guiding members toward main certificates in big data and build skills matrix for big data to help employers. I think certificate section may be a good addition to your web site and build a window for those who got these certificates to tell us about their experience before and after getting it.
Hossam, Good suggestion. It could be a good fit with 2016 plan. Regards, Kunal
Hi Kunal, Two areas where I would like help are - 1) Finding real-time work - working in Kaggle and other competitions are fine but it would be of great help if we can get to work (under any Data Scientist) on real time projects. This will increase our learning scope tremendously. 2) Exploring and Explaining (top) successful solutions in different competitions (Kaggle/AV/CrowdAnalytix etc). Discussing various aspects like - what feature engineering was done which differentiated top rankers from others, what algorithms did top rankers used etc. Along with the learning paths your team is publishing, if you can write few posts on above topics, it would be of immense help to newbies trying to swim through the ocean of Analytis :-)
Highspirits, 1. Good thought - it might be a bit difficult to channelize something like this. But we have a few thoughts to test out in 2016. 2. We have done a few articles on this line and would continue to do so. On our hackathons, people usually share their approach on the discussion portals - so look out for them. Have an awesome 2016. Regards, Kunal
Hi Kunal, Your articles on Analytics are always informative and helpful. Above all your committment to help fellow Data Analysts and Data Scientists is amazing. We always look forward to what comes from you. Keep the good work going and we wish you a wonderful New Year 2016. Really appreciate if you could clarify the following questions. Q1) What tools do you suggest for text mining preferbly any tools that require no or little scripting? Q2) I am working on a project where we have 600,000 plus records in MySql database table. I need to search for a string of two words in the column2 (long text). Then pick 8 character numeric that follows the two word search string and insert both 8 character numeric from column2 and ID from column1 into a separate table..Please suggest what is the best fit tool for this requirement? If you have any code snippet please share. Thank you as always, Trinadh
Trinadh, Best place to start looking for 1 is 'tm' library in R or 'nltk' library in Python. For the second question - please post it on our discussion portal: http://discuss.analyticsvidhya.com along with a few lined of data and what you want. I am sure some one in the community will help you out. Regards. Kunal
Hi Kunal, Wish u a happy new year.. hope this year will be the fantastic year for you and Analytics vidhya Community. Regards, Vidya
Happy New Year Kunal, First things I want to say thanks to Analytics Vidhya, I learn a lot in 2015 from blogs, articles, and participating in hackathon. I hope our learning will be continue in 2016 also. Thanks and Regard Uday Singh