By David Smith
It’s been over a year since we first introduced introduced the Team Data Science Process (TDSP). The data, technology and practices behind Data Science continue to evolve, and the TDSP has evolved in parallel. Over the past year, several new facets have been added, including:
- The IDEAR (Interactive Data Exploration, Analysis and Reporting) framework, an open source extension to R and Python designed to standardize the process of data exploration and reporting;
- Guidance for use of Spark 2.0, including an end-to-end Spark v2.0 walkthrough;
- Guidance for use of in-database Python with SQL Server, including an end-to-end in-database Python tutorial;
- Instantiation of TDSP projects and templates within the new Azure Machine Workbench.
For an example of applying the TDSP to effective data science projects, check out Buck Woody’s 10-part series walking through every stage of a typical data science project.
Cortana Intelligence and Machine Learning Blog: The Microsoft Team Data Science Process (TDSP) – Recent Updates
R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more…
Source:: R News