Predicting Categorical and Continuous Outcomes Using Rule of Ten (CARRoT)
Predicts categorical or continuous outcomes while concentrating on four key points. These are Cross-validation, Accuracy, Regression and Rule of Ten (CARRoT). It performs the cross-validation specified number of times by partitioning the input into training and test set and fitting linear/multinomial/binary regression models to the training set. All regression models satisfying a rule of ten events per variable are fitted and the ones with the best predictive power are given as an output. Best predictive power is understood as highest accuracy in case of binary/multinomial outcomes, smallest absolute and relative errors in case of continuous outcomes. For binary case there is also an option of finding a regression model which gives the highest AUROC (Area Under Recever Operating Curve) value. The option of parallel toolbox is also available. Methods are described in Peduzzi et al. (1996) <doi:10.1016/S0895-4356(96)00236-3> and Rhemtulla et al. (2012) <doi:10.1037/a0029315>.

Detect Small Changes in Process Mean using CUSUM Control Chart by v-Mask (vMask)
The cumulative sum (CUSUM) control chart is considered to be an alternative or complementary to Shewhart control charts in statistical process control (SPC) applications, owing to its higher sensitivity to small shifts in the process mean. It utilizes all the available data rather than the last few ones used in Shewhart control charts for quick decision making. V-mask is a traditional technique for separating meaningful data from unusual circumstances in a Cumulative Sum (CUSUM) control chart; for see details about v-mask see Montgomery (1985, ISBN:978-0471656319). The mask is a V-shaped overlay placed on the CUSUM chart so that one arm of the V lines up with the slope of data points, making it easy to see data points that lie outside the slope and to determine whether these points should be discarded as random events, or treated as a performance trend that should be addressed. But, complex computations is one disadvantage V-mask method for detect small changes in mean using CUSUM control chart. Package ‘vMask’ can help to the applied users to overcome this challenge by considering six different methods which each of them are based on different information.

Visualizing Panel Data with Dichotomous Treatments (panelView)
Visualizes panel data with dichotomous treatments. ‘panelView’ has two main functionalities: (1) it visualizes the treatment and missing-value statuses of each observation in a panel/time-series-cross-sectional (TSCS) dataset; and (2) it plots the outcome variable (either continuous or discrete) in a time-series fashion.

Tools for Data Diagnosis, Exploration, Transformation (dlookr)
A collection of tools that support data diagnosis, exploration, and transformation. Data diagnostics provides information and visualization of missing values and outliers and unique and negative values to help you understand the distribution and quality of your data. Data exploration provides information and visualization of the descriptive statistics of univariate variables, normality tests and outliers, correlation of two variables, and relationship between target variable and predictor. Data transformation supports binning for categorizing continuous variables, imputates missing values and outliers, resolving skewness. And it creates automated reports that support these three tasks.

Fitting Joint Mean and Dispersion Effects Models (jmdem)
Joint mean and dispersion effects models fit the mean and dispersion parameters of a response variable by two separate linear models, the mean and dispersion submodels, simultaneously. It also allows the users to choose either the deviance or the Pearson residuals as the response variable of the dispersion submodel. Furthermore, the package provides the possibility to nest the submodels in one another, if one of the parameters has significant explanatory power on the other. Wu & Li (2016) <doi:10.1016/j.csda.2016.04.015>.

Advertisements