-
-
Notifications
You must be signed in to change notification settings - Fork 28
Zoom Meeting Agenda (2021 03 15)
Clarke Iakovakis edited this page Mar 15, 2021
·
9 revisions
Goal: Reach for consensus on creating a plan draft for the next round of edits to the lesson
- Discuss Agenda. Modify as necessary.
- Announcements and Comments
- Review Feedback
- Email from Sarah Lin (2/23/2021 - see below)
- Before we start
- Starting with Data
- Data cleaning and transformation
- Data viz with ggplot2
- Functions and details
- Slack from Saranya Canchi - https://swcarpentry.slack.com/archives/C01CNV5F15H/p1612203060000800
- Slack from Adrienne Cannino - https://swcarpentry.slack.com/archives/C01CNV5F15H/p1612216539009500
- Email from Sarah Lin (2/23/2021 - see below)
- Review Issues - https://github.com/LibraryCarpentry/lc-r/issues
- numerous
- Prioritize Next Steps
- Subgroups
- Next Meeting(s) and meeting frequency
March 15, 2021 Present: Clarke, John Little, Sarah Lin, Dough Joubert, Annajiat Alim A list of maintainers is found in the readme
- Clarke] comments on fundamentally restructuring the lesson and leading with Tidyverse.
- Clarke] Links are broken and exercises that talk about vectors that are not actually created
- Doug] confirmed that starting with the Tidyverse is a good approach.
- Clarke] review the issues scheme and give people permission to assign themselves as an assignee
- Annajiat] suggests we encourage issue-submitters to perform their own changes and submit as a PR. As a reply to the issues, "would you be interested in submitting a PR?"
- Clarke] will give a first pass to all the issue.
- Sarah] Agrees we should suggest that people make issues as PRs
- Doug] Reviews the process as an end-users/submitter. top-right corner (improve this page). end-users types
- Clarke] -- suggest "starting with data" is the great place
- Sarah] -- read in a CSV, what are the common error message (what if you have spaces in names, need to change columns). Don't get too deep in character vectors.
- All] -- tips, tricks, and shared experiences in teaching
- janitor::clean_names()
- example: leading0 issue with ISBNs
- data vectors and data types is found in the _introduction_
- import data wizard is useful for empowering our users
- making R a "tear-free experience"
- Promoting the gratifying experience of having a lot of modules in the advanced section
- for **NEXT Time**
- look closely at [before we start](https://librarycarpentry.org/lc-r/00-before-we-start/index.html)
- Clarke] will make the site up to date and (push changes and rebuild). Clarke will notify us.
- Before we start (all)
- Starting with Data
- What is a data.frame?
- How can I read a complete csv file into R?
- How can I get basic summary information about my dataset?
- Data cleaning & transformation
- How can I select specific rows and/or columns from a data frame?
- How can I combine multiple commands into a single command?
- How can I create new columns or remove existing columns from a data frame?
- Data Visualization with ggplot2 (all)
- Functions (or a much better name) What is a function and how can we pass arguments to functions?
- How can values be initially assigned to variables of different data types?
- How can a vector be created?
- What are the available data types?
- How can subsets be extracted from vectors?
- How does R treat missing values?
- How can we deal with missing values in R?
- How can I change the way R treats strings in my dataset?
- Why would I want strings to be treated differently?
- How are dates represented in R and how can I change the format?
Caveats:
- My goal with this was the idea that with sections 1-4, a person could leave actually doing something simple in R that was useful in the real world, and if they got section 5 afterward, they'd be able to handle some fairly common problems/use cases (like dates). The first 4 sections would be pretty fast, I think, with 5 possibly just as long as the first 4 in total.
- I didn't check the existing examples and assume that a lesson like this would need great care in making sure the code & examples didn't surface any of the tasks that come up in section 5 and that datasets would be curated to bring up on the commands/ideas being taught
- I think things like strings, vectors, and subsets are more easily understood if you're already mastered a couple of actions and the need naturally arises to do more with the data, rather than front-loading tasks that are eventually useful.