{"id":931,"date":"2018-10-16T04:44:10","date_gmt":"2018-10-16T01:44:10","guid":{"rendered":"http:\/\/kusuaks7\/?p=536"},"modified":"2021-05-11T14:00:28","modified_gmt":"2021-05-11T14:00:28","slug":"the-most-in-demand-skills-for-data-scientists","status":"publish","type":"post","link":"https:\/\/www.experfy.com\/blog\/bigdata-cloud\/the-most-in-demand-skills-for-data-scientists\/","title":{"rendered":"The Most in Demand Skills for Data Scientists"},"content":{"rendered":"<p><strong><em>Ready to learn Data Science? Browse&nbsp;<a href=\"https:\/\/www.experfy.com\/training\/tracks\/data-science-training-certification\">Data Science Training and Certification<\/a> courses developed by industry thought leaders and Experfy in Harvard Innovation Lab.<\/em><\/strong><\/p>\n<h4 id=\"fd66\" name=\"fd66\">What are employers looking&nbsp;for?<\/h4>\n<p id=\"ba33\" name=\"ba33\">Data scientists are expected to know a lot &mdash; machine learning, computer science, statistics, mathematics, data visualization, communication, and deep learning. Within those areas, there are dozens of languages, frameworks, and technologies data scientists could learn. How should data scientists who want to be in demand by employers spend their learning budget?<\/p>\n<p id=\"dca9\" name=\"dca9\">I scoured job listing websites to find which skills are most in demand for data scientists. I looked at general data science skills and at specific languages and tools separately. I searched job listings on&nbsp;<a data-href=\"https:\/\/www.linkedin.com\" href=\"https:\/\/www.linkedin.com\/\" rel=\"noopener noreferrer\" target=\"_blank\">LinkedIn<\/a>,&nbsp;<a data-href=\"https:\/\/www.indeed.com\" href=\"https:\/\/www.indeed.com\/\" rel=\"noopener noreferrer\" target=\"_blank\">Indeed<\/a>,&nbsp;<a data-href=\"https:\/\/www.simplyhired.com\" href=\"https:\/\/www.simplyhired.com\/\" rel=\"noopener noreferrer\" target=\"_blank\">SimplyHired<\/a>,&nbsp;<a data-href=\"https:\/\/www.monster.com\" href=\"https:\/\/www.monster.com\/\" rel=\"noopener noreferrer\" target=\"_blank\">Monster<\/a>, and&nbsp;<a data-href=\"https:\/\/angel.co\/jobs\" href=\"https:\/\/angel.co\/jobs\" rel=\"noopener noreferrer\" target=\"_blank\">AngelList<\/a>&nbsp;on October 10, 2018. Here&rsquo;s a chart showing how many data scientist jobs each website listed.<\/p>\n<figure id=\"16f0\" name=\"16f0\">\n<p><canvas height=\"46\" width=\"75\"><\/canvas><img decoding=\"async\" data-src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*3K7QnzBXI0Ys3NZgNRTezA.png\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*3K7QnzBXI0Ys3NZgNRTezA.png\" \/><\/p>\n<\/figure>\n<p id=\"3b06\" name=\"3b06\">I read through many job listings and surveys to find the most common skills. Terms like&nbsp;<em>management<\/em>&nbsp;were not compared because they can be used in so many different contexts in job listings.<\/p>\n<p id=\"0d46\" name=\"0d46\">All searches were performed for the United States with&nbsp;<em>&ldquo;data scientist&rdquo; &ldquo;[keyword]&rdquo;<\/em>. Using exact match search reduced the number of results. However, this method ensured the results were relevant for data scientist positions and affected all search terms similarly.<\/p>\n<p id=\"1106\" name=\"1106\">AngelList provides the number of companies with data scientist listings rather than the number of positions. I excluded AngelList from both analyses because its search algorithm seems to operate as an&nbsp;<em>OR&nbsp;<\/em>type of logical search, without the ability to change it to an&nbsp;<em>AND<\/em>. AngelList works fine if you are looking for&nbsp;<em>&ldquo;data scientist&rdquo; &ldquo;TensorFlow&rdquo;<\/em>&nbsp;which is only going to be found with data scientist positions, but if your keywords are &ldquo;<em>data scientist&rdquo; &ldquo;react.js&rdquo;<\/em>&nbsp;it returns far too many listings for companies with non-data scientist job listings.<\/p>\n<p id=\"bdfb\" name=\"bdfb\"><a data-href=\"https:\/\/www.glassdoor.com\/index.htm\" href=\"https:\/\/www.glassdoor.com\/index.htm\" rel=\"noopener noreferrer\" target=\"_blank\">Glassdoor<\/a>&nbsp;was also excluded from my analyses. The site stated that it had 26,263&nbsp;<em>&ldquo;data scientist&rdquo;<\/em>&nbsp;jobs in the US, but it would show me no more than 900 jobs. Additionally, it seems highly unlikely it would have more than three times the number of data scientist job listings as any other major platform.<\/p>\n<p id=\"ece2\" name=\"ece2\">Terms with over 400 listings on LinkedIn for general skills and over 200 listings for specific technologies were included in the final analyses. There was certainly some cross posting. The results are recorded in this&nbsp;<a data-href=\"https:\/\/docs.google.com\/spreadsheets\/d\/1df7QTgdAOItQJadLoMHlIZH3AsQ2j2_yoyvHOpsy9qU\/edit?usp=sharing\" href=\"https:\/\/docs.google.com\/spreadsheets\/d\/1df7QTgdAOItQJadLoMHlIZH3AsQ2j2_yoyvHOpsy9qU\/edit?usp=sharing\" rel=\"noopener noreferrer\" target=\"_blank\">Google Sheet<\/a>.<\/p>\n<p id=\"980e\" name=\"980e\">I downloaded&nbsp;.csv files and imported them into JupyterLab. I then computed the percentage occurrences and averaged them across the job listing websites.<\/p>\n<p id=\"9d70\" name=\"9d70\">I also compared the software results to a&nbsp;<a data-href=\"https:\/\/www.glassdoor.com\/research\/data-scientist-personas\/\" href=\"https:\/\/www.glassdoor.com\/research\/data-scientist-personas\/\" rel=\"noopener noreferrer\" target=\"_blank\">Glassdoor study<\/a>&nbsp;of its data scientist job listings from the first half of 2017. Combined with information from&nbsp;<a data-href=\"https:\/\/www.kdnuggets.com\/2018\/05\/poll-tools-analytics-data-science-machine-learning-results.html\/2\" href=\"https:\/\/www.kdnuggets.com\/2018\/05\/poll-tools-analytics-data-science-machine-learning-results.html\/2\" rel=\"noopener noreferrer\" target=\"_blank\">KDNuggets&rsquo; usage survey<\/a>, it appears some skills are becoming more important and others are losing importance. We&rsquo;ll get to those in a bit.<\/p>\n<p id=\"e825\" name=\"e825\">See my Kaggle Kernel for interactive charts and additional analyses&nbsp;<a data-href=\"https:\/\/www.kaggle.com\/discdiver\/the-most-in-demand-skills-for-data-scientists\/\" href=\"https:\/\/www.kaggle.com\/discdiver\/the-most-in-demand-skills-for-data-scientists\/\" rel=\"noopener noreferrer\" target=\"_blank\">here<\/a>. I used Plotly for the visualizations. To use Plotly with JupyterLab takes a little wrangling as of this writing &mdash; instructions are at the end of my Kaggle Kernel and in&nbsp;<a data-href=\"https:\/\/github.com\/plotly\/plotly.py\" href=\"https:\/\/github.com\/plotly\/plotly.py\" rel=\"noopener noreferrer\" target=\"_blank\">Plotly&rsquo;s docs<\/a>.<\/p>\n<h3 id=\"4af4\" name=\"4af4\">General Skills<\/h3>\n<p id=\"a8d0\" name=\"a8d0\">Here&rsquo;s the chart of the most frequent general data scientist skills sought by employers.<\/p>\n<figure id=\"c9fc\" name=\"c9fc\">\n<p><canvas height=\"46\" width=\"75\"><\/canvas><img decoding=\"async\" data-src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*-oG0j_wGSW_9cNNs4_qgFQ.png\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*-oG0j_wGSW_9cNNs4_qgFQ.png\" \/><\/p>\n<\/figure>\n<p id=\"c18b\" name=\"c18b\">The results show that analysis and machine learning are at the heart of data scientist jobs. Gleaning insights from data is a primary function of data science. Machine learning is all about creating systems to predict performance and it is very in demand.<\/p>\n<p id=\"9c49\" name=\"9c49\">Data science requires statistics and computer science skills &mdash; no surprise there. Statistics, computer science, and mathematics are also college majors, which probably helps their frequency.<\/p>\n<p id=\"629b\" name=\"629b\">It is interesting that communication is mentioned in nearly half of job listings. Data scientists need to be able communicate insights and work with others.<\/p>\n<p id=\"f0d7\" name=\"f0d7\">AI and deep learning don&rsquo;t show up as frequently as some other terms. However, they are subsets of machine learning. Deep learning is being used for more and more of the machine learning tasks that other algorithms were used for previously. For example, the best machine learning algorithms for most natural language processing problems are now deep learning algorithms. I expect deep learning skills will be sought more explicitly in the future and that machine learning will become more synonymous with deep learning.<\/p>\n<p id=\"8c10\" name=\"8c10\">Which specific software tools for data scientists are employers looking for? Let&rsquo;s tackle that question next.<\/p>\n<h3 id=\"abe0\" name=\"abe0\">Technology Skills<\/h3>\n<p id=\"c88b\" name=\"c88b\">Below are the top 20 specific languages, libraries, and tech tools employers are looking for data scientists to have experience with.<\/p>\n<figure id=\"4f00\" name=\"4f00\">\n<p><canvas height=\"46\" width=\"75\"><\/canvas><img decoding=\"async\" data-src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*jnZT4gFAzScOJ_VnYsni0g.png\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*jnZT4gFAzScOJ_VnYsni0g.png\" \/><\/p>\n<\/figure>\n<p id=\"7861\" name=\"7861\">Let&rsquo;s briefly look at the most common tech skills.<\/p>\n<figure id=\"aaaf\" name=\"aaaf\">\n<p><img decoding=\"async\" data-height=\"100\" data-image-id=\"1*H1olKNHMeAiPbDoDf95MYw.png\" data-width=\"296\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*H1olKNHMeAiPbDoDf95MYw.png\" \/><\/p>\n<\/figure>\n<p id=\"07b3\" name=\"07b3\"><a data-href=\"https:\/\/www.python.org\/\" href=\"https:\/\/www.python.org\/\" rel=\"noopener noreferrer\" target=\"_blank\">Python<\/a>&nbsp;is the most in-demand language. The popularity of this open-source language has been widely observed. It&rsquo;s beginner friendly, with many support resources. The vast majority of new data science tools are compatible with it. Python is the primary language for data scientists.<\/p>\n<figure id=\"93b6\" name=\"93b6\">\n<p><img decoding=\"async\" data-height=\"100\" data-image-id=\"1*KvDjHLuke7XW0EDXl4HObA.png\" data-width=\"129\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*KvDjHLuke7XW0EDXl4HObA.png\" \/><\/p>\n<\/figure>\n<p id=\"8420\" name=\"8420\">R is not far behind Python. It once was the primary language for data science. I was surprised to see how in demand it still is. The roots of this open source language are in statistics, and it&rsquo;s still very popular with statisticians.<\/p>\n<p id=\"a292\" name=\"a292\">Python or R is a must for virtually every data scientist position.<\/p>\n<figure id=\"7eda\" name=\"7eda\">\n<p><img decoding=\"async\" data-height=\"100\" data-image-id=\"1*m0Ef1gHgIfsZ3ha4dRjMPw.png\" data-width=\"219\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*m0Ef1gHgIfsZ3ha4dRjMPw.png\" \/><\/p>\n<\/figure>\n<p id=\"1ce5\" name=\"1ce5\"><a data-href=\"https:\/\/en.wikipedia.org\/wiki\/SQL\" href=\"https:\/\/en.wikipedia.org\/wiki\/SQL\" rel=\"noopener noreferrer\" target=\"_blank\">SQL<\/a>&nbsp;is also in high demand. SQL stands for Structured Query Language and is the primary way to interact with relational databases. SQL is sometimes overlooked in the data science world, but it&rsquo;s a skill worth demonstrating mastery of if you&rsquo;re planning to hit the job market.<\/p>\n<figure data-scroll=\"native\" id=\"49b7\" name=\"49b7\">\n<p><img decoding=\"async\" data-height=\"100\" data-image-id=\"1*A3CsEqNfW-hGQJcW_1aYnQ.png\" data-width=\"386\" src=\"https:\/\/cdn-images-1.medium.com\/max\/800\/1*A3CsEqNfW-hGQJcW_1aYnQ.png\" \/><\/p>\n<\/figure>\n<figure data-scroll=\"native\" id=\"c8f0\" name=\"c8f0\">\n<p><img decoding=\"async\" data-height=\"100\" data-image-id=\"1*6B_ehX789jDsxzD3rvzI2w.png\" data-width=\"192\" src=\"https:\/\/cdn-images-1.medium.com\/max\/800\/1*6B_ehX789jDsxzD3rvzI2w.png\" \/><\/p>\n<\/figure>\n<p id=\"9d8e\" name=\"9d8e\">Up next are&nbsp;<a data-href=\"https:\/\/hadoop.apache.org\/\" href=\"https:\/\/hadoop.apache.org\/\" rel=\"noopener noreferrer\" target=\"_blank\">Hadoop<\/a>&nbsp;and&nbsp;<a data-href=\"https:\/\/spark.apache.org\/\" href=\"https:\/\/spark.apache.org\/\" rel=\"noopener noreferrer\" target=\"_blank\">Spark<\/a>, both open source tools from Apache for big data.<\/p>\n<blockquote id=\"2ec2\" name=\"2ec2\"><p>Apache Hadoop is an open source software platform for distributed storage and distributed processing of very large data sets on computer clusters built from commodity hardware.\u200a&mdash;<a data-href=\"https:\/\/hortonworks.com\/apache\/hadoop\/\" href=\"https:\/\/hortonworks.com\/apache\/hadoop\/\" rel=\"noopener noreferrer\" target=\"_blank\">\u200aSource<\/a>.<\/p><\/blockquote>\n<blockquote id=\"350e\" name=\"350e\"><p>Apache Spark is a fast, in-memory data processing engine with elegant and expressive development APIs to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.\u200a&mdash;\u200aS<a data-href=\"https:\/\/hortonworks.com\/apache\/spark\/\" href=\"https:\/\/hortonworks.com\/apache\/spark\/\" rel=\"noopener noreferrer\" target=\"_blank\">ource<\/a>.<\/p><\/blockquote>\n<p id=\"2222\" name=\"2222\">These tools have considerably less written about them on Medium and in tutorials than many others. I expect many fewer job candidates have these skills than Python, R, and SQL. If you have or can gain experience with Hadoop and Spark it should give you a leg up on the competition.<\/p>\n<figure id=\"9f90\" name=\"9f90\">\n<p><img decoding=\"async\" data-height=\"100\" data-image-id=\"1*zhmi-0GKaelOE_ri5b09WA.jpeg\" data-width=\"163\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*zhmi-0GKaelOE_ri5b09WA.jpeg\" \/><\/p>\n<\/figure>\n<figure id=\"9da6\" name=\"9da6\">\n<p><img decoding=\"async\" data-height=\"100\" data-image-id=\"1*GhlMX8eVpMhwMR9gYwLmSg.jpeg\" data-width=\"241\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*GhlMX8eVpMhwMR9gYwLmSg.jpeg\" \/><\/p>\n<\/figure>\n<p id=\"cfc6\" name=\"cfc6\">Then come&nbsp;<a data-href=\"https:\/\/www.java.com\/en\/\" href=\"https:\/\/www.java.com\/en\/\" rel=\"noopener noreferrer\" target=\"_blank\">Java<\/a>&nbsp;and&nbsp;<a data-href=\"https:\/\/www.sas.com\/en_us\/home.html\" href=\"https:\/\/www.sas.com\/en_us\/home.html\" rel=\"noopener noreferrer\" target=\"_blank\">SAS<\/a>. I was surprised to see these languages as high as they are. Both have large companies behind them, and at least some free offerings. Both Java and SAS generally receive little attention in the data science community.<\/p>\n<figure id=\"6dee\" name=\"6dee\">\n<p><img decoding=\"async\" data-height=\"100\" data-image-id=\"1*F9R8s7SvAozrxu81-n1ynA.png\" data-width=\"481\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*F9R8s7SvAozrxu81-n1ynA.png\" \/><\/p>\n<\/figure>\n<p id=\"e787\" name=\"e787\"><a data-href=\"https:\/\/www.tableau.com\/\" href=\"https:\/\/www.tableau.com\/\" rel=\"noopener noreferrer\" target=\"_blank\">Tableau<\/a>&nbsp;is next in demand. This analytics platform and visualization tool is powerful, easy to use, and growing in popularity. It has a free public version, but will cost you money if you want to keep your data private.<\/p>\n<p id=\"f27e\" name=\"f27e\">If you aren&rsquo;t familiar with Tableau, it&rsquo;s definitely worth taking a quick class such as&nbsp;<a data-href=\"https:\/\/www.udemy.com\/tableau10\/\" href=\"https:\/\/www.udemy.com\/tableau10\/\" rel=\"noopener noreferrer\" target=\"_blank\">Tableau 10 A-Z<\/a>&nbsp;on Udemy. I don&rsquo;t get a commission for the suggestion&mdash; I just took the class and found it to be a great value.<\/p>\n<p id=\"8ee8\" name=\"8ee8\">The chart below shows an even bigger list of the most in demand languages, frameworks, and other data science software tools.<\/p>\n<figure id=\"27c0\" name=\"27c0\">\n<p><canvas height=\"46\" width=\"75\"><\/canvas><img decoding=\"async\" data-src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*ms7fwYHNdXuGQ_0vKINkNQ.png\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*ms7fwYHNdXuGQ_0vKINkNQ.png\" \/><\/p>\n<\/figure>\n<h4 id=\"2fe5\" name=\"2fe5\">Historical Comparison<\/h4>\n<p id=\"b71e\" name=\"b71e\">GlassDoor did an&nbsp;<a data-href=\"https:\/\/www.glassdoor.com\/research\/data-scientist-personas\/\" href=\"https:\/\/www.glassdoor.com\/research\/data-scientist-personas\/\" rel=\"noopener noreferrer\" target=\"_blank\">analysis<\/a>&nbsp;of the 10 most common software skills for data scientists from January 2017 through July 2017 on their site. Here&rsquo;s a comparison of how frequently the terms appeared on their site compared to the average on LinkedIn, Indeed, SimplyHired, and Monster in October 2018.<\/p>\n<figure id=\"6b65\" name=\"6b65\">\n<p><canvas height=\"46\" width=\"75\"><\/canvas><img decoding=\"async\" data-src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*iueZKOOBidZtr-FTYyf6QA.png\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*iueZKOOBidZtr-FTYyf6QA.png\" \/><\/p>\n<\/figure>\n<p id=\"47c2\" name=\"47c2\">The results are fairly similar. Both my analysis and GlassDoor&rsquo;s found Python, R, and SQL to be the most in demand. We also found the same top nine technology skills, albeit in slightly different orders.<\/p>\n<p id=\"eff7\" name=\"eff7\">The results suggest that compared to the first half of 2017, R, Hadoop, Java, SAS, and MatLab are now less in demand and Tableau is more in demand. This is what I would expect given the complementary results from sources such as the&nbsp;<a data-href=\"https:\/\/www.kdnuggets.com\/2018\/05\/poll-tools-analytics-data-science-machine-learning-results.html\/2\" href=\"https:\/\/www.kdnuggets.com\/2018\/05\/poll-tools-analytics-data-science-machine-learning-results.html\/2\" rel=\"noopener noreferrer\" target=\"_blank\">KDnuggets developer survey<\/a>. There, R, Hadoop, Java, and SAS all show clear multi-year downward usage trends and Tableau shows a clear upward trend.<\/p>\n<h3 id=\"8588\" name=\"8588\">Recommendations<\/h3>\n<p id=\"bcc3\" name=\"bcc3\">Based on the results of these analyses, here are some general recommendations for current and aspiring data scientists concerned with making themselves widely marketable.<\/p>\n<ul>\n<li id=\"ae89\" name=\"ae89\">Demonstrate you can do data analysis and focus on becoming really skilled at machine learning.<\/li>\n<li id=\"6fd2\" name=\"6fd2\">Invest in your communication skills. I recommend reading the book&nbsp;<a data-href=\"https:\/\/www.amazon.com\/Made-Stick-Ideas-Survive-Others\/dp\/1400064287\" href=\"https:\/\/www.amazon.com\/Made-Stick-Ideas-Survive-Others\/dp\/1400064287\" rel=\"noopener noreferrer\" target=\"_blank\">Made to Stick<\/a>&nbsp;to help your ideas have more impact. Also check out the&nbsp;<a data-href=\"http:\/\/www.hemingwayapp.com\/\" href=\"http:\/\/www.hemingwayapp.com\/\" rel=\"noopener noreferrer\" target=\"_blank\">Hemmingway Editor<\/a>&nbsp;app to improve the clarity of your writing.<\/li>\n<li id=\"20b5\" name=\"20b5\">Master a deep learning framework. Being proficient with a deep learning framework is a larger and larger part of being proficient with machine learning.&nbsp;<\/li>\n<li id=\"e1a2\" name=\"e1a2\">If you are choosing between learning Python and R, choose Python. If you have Python down cold, consider learning R. You&rsquo;ll definitely be more marketable if you also know R.<\/li>\n<\/ul>\n<p id=\"bffd\" name=\"bffd\">When an employer is looking for a data scientist with Python skills, they are also likely to expect candidates to know the common python data science libraries: numpy, pandas, scikit-learn, and matplotlib. If you&rsquo;re looking to learn this set of tools, I suggest the following resources:<\/p>\n<ul>\n<li id=\"9e3b\" name=\"9e3b\"><a data-href=\"https:\/\/www.datacamp.com\/\" href=\"https:\/\/www.datacamp.com\/\" rel=\"noopener noreferrer\" target=\"_blank\">DataCamp<\/a>&nbsp;and&nbsp;<a data-href=\"https:\/\/www.dataquest.io\/\" href=\"https:\/\/www.dataquest.io\/\" rel=\"noopener noreferrer\" target=\"_blank\">DataQuest<\/a> &mdash; they are both reasonably priced online SaaS data science education products where you learn as you code. They both teach a number of technology tools.<\/li>\n<li id=\"168a\" name=\"168a\"><a data-href=\"https:\/\/www.dataschool.io\/start\/\" href=\"https:\/\/www.dataschool.io\/start\/\" rel=\"noopener noreferrer\" target=\"_blank\">Data School<\/a>&nbsp;has a variety of resources including a nice set of&nbsp;<a data-href=\"https:\/\/www.youtube.com\/dataschool\" href=\"https:\/\/www.youtube.com\/dataschool\" rel=\"noopener noreferrer\" target=\"_blank\">YouTube videos<\/a>&nbsp;explaining data science concepts.<\/li>\n<li id=\"d4d6\" name=\"d4d6\"><a data-href=\"https:\/\/www.amazon.com\/Python-Data-Analysis-Wrangling-IPython\/dp\/1491957662\" href=\"https:\/\/www.amazon.com\/Python-Data-Analysis-Wrangling-IPython\/dp\/1491957662\" rel=\"noopener noreferrer\" target=\"_blank\"><em>Python for Data Analysis<\/em><\/a><em>&nbsp;<\/em>by McKinney. This book by the primary author of the pandas library focusses on pandas and also discusses basic python, numpy, and scikit-learn functionality for data science.<\/li>\n<li id=\"3af9\" name=\"3af9\"><a data-href=\"https:\/\/www.amazon.com\/Introduction-Machine-Learning-Python-Scientists-ebook\/dp\/B01M0LNE8C\" href=\"https:\/\/www.amazon.com\/Introduction-Machine-Learning-Python-Scientists-ebook\/dp\/B01M0LNE8C\" rel=\"noopener noreferrer\" target=\"_blank\"><em>Introduction to Machine Leaning with Python<\/em><\/a><em>&nbsp;<\/em>by M&uuml;ller &amp; Guido. M&uuml;ller is a primary maintainer of scikit-learn. It&rsquo;s an excellent book for learning machine learning with scikit-learn.<\/li>\n<\/ul>\n<p id=\"6434\" name=\"6434\">If you are looking to jump into deep learning, I suggest starting with&nbsp;<a data-href=\"https:\/\/keras.io\/\" href=\"https:\/\/keras.io\/\" rel=\"noopener noreferrer\" target=\"_blank\">Keras<\/a>&nbsp;or&nbsp;<a data-href=\"https:\/\/github.com\/fastai\/fastai\" href=\"https:\/\/github.com\/fastai\/fastai\" rel=\"noopener noreferrer\" target=\"_blank\">FastAI<\/a>&nbsp;before moving on to&nbsp;<a data-href=\"https:\/\/www.tensorflow.org\/\" href=\"https:\/\/www.tensorflow.org\/\" rel=\"noopener noreferrer\" target=\"_blank\">TensorFlow<\/a>&nbsp;or&nbsp;<a data-href=\"https:\/\/pytorch.org\/\" href=\"https:\/\/pytorch.org\/\" rel=\"noopener noreferrer\" target=\"_blank\">PyTorch<\/a>. Chollet&rsquo;s&nbsp;<a data-href=\"https:\/\/www.amazon.com\/Deep-Learning-Python-Francois-Chollet\/dp\/1617294438\" href=\"https:\/\/www.amazon.com\/Deep-Learning-Python-Francois-Chollet\/dp\/1617294438\" rel=\"noopener noreferrer\" target=\"_blank\"><em>Deep Learning with Python<\/em><\/a>&nbsp;is a great resource for learning Keras.<\/p>\n<p id=\"ac92\" name=\"ac92\">Beyond these recommendations, I suggest you learn what interests you, although there are obviously many considerations when deciding how to allocate your learning time.<\/p>\n<figure id=\"25d4\" name=\"25d4\">\n<p><img decoding=\"async\" data-height=\"50\" data-image-id=\"1*sR0SOBlt8njlBt6tG6gwww.png\" data-width=\"185\" src=\"https:\/\/cdn-images-1.medium.com\/max\/640\/1*sR0SOBlt8njlBt6tG6gwww.png\" \/><\/p>\n<\/figure>\n<p id=\"c735\" name=\"c735\">If you&rsquo;re looking for a data scientist job through online portals, I suggest you start with LinkedIn &mdash; it consistently has the most results.<\/p>\n<p id=\"86dd\" name=\"86dd\">If you are looking for a job or posting positions on job sites, keywords matter. &ldquo;<em>data science<\/em>&rdquo; returns nearly 3x the number of results that &ldquo;<em>data scientist<\/em>&rdquo; does on each site. But if you are looking strictly for a data scientist job, you&rsquo;re probably better off searching for &ldquo;<em>data scientist<\/em>&rdquo;.<\/p>\n<p id=\"f52d\" name=\"f52d\">Regardless of where you&rsquo;re looking, I suggest you make an online portfolio that demonstrates your proficiency with as many in-demand skill areas as possible. I also suggest your LinkedIn profile showcase your skills.<\/p>\n<p id=\"f032\" name=\"f032\">As part of this project, I collected other data that I may turn into articles. Follow me to make sure you don&rsquo;t miss out.<\/p>\n<p id=\"f3c3\" name=\"f3c3\">If you want to see the interactive plotly charts and the code behind them, check out my&nbsp;<a data-href=\"https:\/\/www.kaggle.com\/discdiver\/the-most-in-demand-skills-for-data-scientists\/\" href=\"https:\/\/www.kaggle.com\/discdiver\/the-most-in-demand-skills-for-data-scientists\/\" rel=\"noopener noreferrer\" target=\"_blank\">Kaggle Kernel<\/a>.<\/p>\n<p id=\"ea33\" name=\"ea33\">I hope this article has provided you with some insights into what organizations hiring data scientists are looking for. If you learned something, please clap and share on Twitter so others will be more likely to find it<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Data scientists are expected to know a lot &mdash; machine learning, computer science, statistics, mathematics, data visualization, communication, and deep learning. Within those areas there are dozens of languages, frameworks, and technologies data scientists could learn. How should data scientists who want to be in demand by employers spend their learning budget? Which skills are most in demand for data scientists?&nbsp;<\/p>\n","protected":false},"author":369,"featured_media":3212,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"content-type":"","footnotes":""},"categories":[187],"tags":[94],"ppma_author":[2134],"class_list":["post-931","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-bigdata-cloud","tag-data-science"],"authors":[{"term_id":2134,"user_id":369,"is_guest":0,"slug":"jeff-hale","display_name":"Jeff Hale","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/?s=96&d=mm&r=g","user_url":"","last_name":"Hale","first_name":"Jeff","job_title":"","description":"Jeff Hale is a co-founder of Rebel Desk, where he oversees technology, finance, and operations for this company. He&nbsp;is an experienced entrepreneur who has managed technology, operations, and finances for several companies.&nbsp;"}],"_links":{"self":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/931","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/users\/369"}],"replies":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/comments?post=931"}],"version-history":[{"count":1,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/931\/revisions"}],"predecessor-version":[{"id":6256,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/posts\/931\/revisions\/6256"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media\/3212"}],"wp:attachment":[{"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/media?parent=931"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/categories?post=931"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/tags?post=931"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.experfy.com\/blog\/wp-json\/wp\/v2\/ppma_author?post=931"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}