Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data,and apply knowledge and actionable insights from data across a broad range of application domains. Data science is said to data processing , machine learning and large data.
Data science may be a “concept to unify statistics, data analysis, informatics, and their related methods” so as to “understand and analyze actual phenomena” with data. It uses techniques and theories drawn from many fields within the context of mathematics, statistics, computing , informatics , and domain knowledge. However, data science is different from computing and knowledge science. Turing Award winner Jim Gray imagined data science as a “fourth paradigm” of science (empirical, theoretical, computational, and now data-driven) and asserted that “everything about science is changing due to the impact of data technology” and therefore the data deluge
There are a spread of various technologies and techniques that are used for data science which depend upon the appliance . More recently, full-featured, end-to-end platforms are developed and heavily used for data science and machine learning
Data analysts bridge the gap between data scientists and business analysts. they’re given the questions that require answering from a corporation then organize and analyze data to seek out results that align with high-level business strategy. Data analysts are liable for translating technical analysis to qualitative action items and effectively communicating their findings to diverse stakeholders.
Skills needed: Programming skills (SAS, R, Python), statistical and mathematical skills, data wrangling, data visualization
Data engineers manage exponential amounts of rapidly changing data. They specialise in the event , deployment, management, and optimization of knowledge pipelines and infrastructure to rework and transfer data to data scientists for querying.
Skills needed: Programming languages (Java, Scala), NoSQL databases (MongoDB, Cassandra DB), frameworks (Apache Hadoop)