DEV Community

Cover image for Popular Data Science Tools used by Data Scientists 🧠
amananandrai
amananandrai

Posted on • Edited on

Popular Data Science Tools used by Data Scientists 🧠

Data science according to Wikipedia is defined as

"Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from many structural and unstructured data. Data science is related to data mining, machine learning and big data."

Alt Text



Many people have this notion that Data Science just means training Machine Learning models but it is an amalgamation of various fields. A person must have knowledge of statistics, cloud technologies, coding and databases to become a data scientist. With the increasing change in technology knowledge of DevOps for Machine Learning(known as MLOps) and AutoML is also necessary.

Kaggle the largest community of Data Scientists conducted a survey and based on that survey I am presenting the list of different tools and technologies widely used in different domains of Data Science. Feel free to add some other tools you know which are popular and not mentioned in this list.

Machine Learning Frameworks

Alt Text



Machine Learning is one of the core technologies associated with Data Science. Python and R are the widely used languages for ML. The most popular frameworks are based on Python namely scikit-learn, Tensorflow and PyTorch.

Enterprise Machine Learning Tools

Alt Text


Most of the beginners don't know about the tools for ML on cloud, these are some of the best enterprise ML tools by software giants like Google, Amazon and Azure.

Business Intelligence Tools

Alt Text


Business Intelligence means analyzing data of companies and giving reports and predicting sales and markets. It is one of the most popular use cases of datascience and deals mainly with Statistics and Data Visualization. Some of the most popular Business Intelligence tools used in the industry are given below.

Databases Used

Alt Text


Databases form an important part of DataScience because without data there is no datascience. The different databases used by data scientists all over the world is as follow.

Automated Machine Learning (AutoML)

Alt Text


AutoML is one of the most promising technology of modern era which is growing at an alarming rate. Some of the famous AutoML tools are-

Cover Image Cartoon vector created by vectorjuice - www.freepik.com

Top comments (1)

Collapse
 
dcsan profile image
dc

please add some links to your articles...