All you need to know on Correspondence Analysis …
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Correspondence Analysis – CA – is an exploratory multivariate method for exploring and visualizing contingency tables, i.e. tables on which a chi-squared test can be performed. CA is particularly useful in text mining.
The function Factoshiny of the package Factoshiny allows you to perform CA in an easy way. You can include extras information, manage missing data, draw and improve the graph interactively, have several numeric indicators as outputs, perform clustering on the CA results, and even have an automatic interpretation of the results. Finally, the function returns the lines of code to parameterize the analysis and redo the graphs, which makes the analysis reproducible.
Implementation with R software
See this video and the audio transcription of this video:
The lines of code to do a Correspondence Analysis:
install.packages(Factoshiny) library(Factoshiny) data(children) result <- Factoshiny(children)
Course videos
Theorectical and practical informations on Correspondence Analysis are available in these 6 course videos:
- Introduction
- Visualizing the row and column clouds
- Inertia and percentage of inertia
- Simultaneous representation
- Interpretation aids
- Text mining with correspondence analysis
Here are the slides and the audio transcription of the course.
Materials
Here is the material used in the videos:
- Nobel data: the data set – Rmarkdown – the script with the outputs
- Birth data: the data set – Rmarkdown – the script with the outputs
Follow this link if you want to see more methods on Exploratory Data Analysis.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.