The Carpentries

The Carpentries is a nonprofit organization that teaches software engineering and data science skills to researchers through instructional workshops.[1][2] The Carpentries is made up of three programs areas: Software Carpentry, Data Carpentry and Library Carpentry.

The Carpentries
FounderGreg Wilson
Location
Executive Director
Kari L. Jordan
Websitecarpentries.org
Formerly called
Software Carpentry Foundation

The Carpentries workshops have been run internationally, including workshops at the Smithsonian Institution,[3] the Australian Research Data Commons,[4] CERN,[5] and in Antarctica.[6]

History

Software carpenter Greg Wilson

Software Carpentry workshops began in 1998 as week-long training courses by Brent Gorda and Greg Wilson.[7][8][9] at Los Alamos National Laboratory. The Software Carpentry Foundation was formed in 2014 alongside the sibling foundation, Data Carpentry.[9] These organizations were merged in 2018 to form what is now known as The Carpentries.[2] In 2018, Library Carpentry became the third lesson program of The Carpentries.[1]

Workshops

Carpentries workshops are two-day workshops led by volunteer instructors who have been certified through the organization's training program.[10][11] Content covered in a standard workshop includes using the command line and an introduction to a programming language such as R or Python.[1][12] Workshops under the Data Carpentry program focus on specific subject domains, such as life sciences or social sciences.[10]

A Software Carpentry workshop is designed as an active learning and collaborative experience. The lesson content is hands-on with practice following instructors live coding, while helpers are ready to assist students and keep the class pace. Training covers the core skills needed to be productive in a small research team. Tutorials in the lesson alternate with practical exercises, where collaboration is attempted. There is a collaborative document where the learning process is constructed.[13][14]

Lessons

Stable lessons

All lesson content under The Carpentries curriculum are licensed openly under Creative Commons licenses.[1][11]

Before being adopted as an official Carpentries lesson, new lessons go through a series of stages designed to ensure they are sufficiently documented to be teachable by instructors outside of the initial author group.

The Carpentries shares The Carpentries Community Developed Lessons (there are three core topics: the Unix shell, version control with Git, and a programming language (Python or R). Curricula for these lessons in English and Spanish (select lessons only) and also Data Carpentry's lessons (which focus on data organization, cleanup, analysis, and visualization).

The Carpentries Community Developed Lessons

There are six stable lessons in total:

  • The Unix Shell: This lesson comprises The Unix Shell organization.[15] This is a power tool that allows people to do complex things with just a few keystrokes and automate repetitive tasks. Use of the shell is fundamental to using a wide range of other powerful tools and computing resources.
  • Version Control with Git: This lesson comprises Version Control with Git.[16]
  • Programming with Python: This lesson comprises Programming with Python.[17]
  • Plotting and Programming in Python: This lesson comprises Plotting and Programming in Python.[18]
  • Programming with R: This lesson comprises Programming with R.[19]
  • R for Reproducible Scientific Analysis: This lesson comprises R for Reproducible Scientific Analysis.[20]
Data Carpentry's lessons
  • Ecology lessons: This lesson comprises Ecology Workshop.[21]
  • Genomics lessons: This lesson comprises Genomic Workshop.[22] The data use in this lesson is part of the Lenski experiment. This lesson starts from thinking a genomic research, using the terminal to assess quality and goes until variation analysis.
  • Social lessons: This lesson comprises Social Science Workshop.[23]
  • Geospatial data lessons: This lesson comprises Geospatial Data.[24]

Community developed lesson

The Carpentries community is committed to a collaborative and open process for lesson development and to sharing teaching materials. The Carpentries incubator [25] contains lessons developed by community members. These lessons follow a life cycle that begins with pre-alpha, where only the concept is offered, and ends with beta, where the lesson is taught in a workshop by instructors other than the authors. There are 4 stages: pre-alpha, alpha, beta, and stable.

Pre-alpha is the draft from the initial lesson idea. Alpha's goal is to collect and incorporate feedback from learners and co-instructor. The two lessons in beta stages are Reproducible Computational Environments using Containers[26] and Data Harvesting for Agriculture.[27]

Carpentries incubator has approximately 30 lessons available in alpha stage, ranging from a spreadsheet to a database[28] through Python for Humanities[29] and Metagenomics.[30] There is another main way for community members to share lessons material: The CarpentriesLab,[25] which is a repository for high-quality, peer-reviewed, short-format, lessons that use the teaching approach and lesson design from The Carpentries. It is also possible to get peer-review on the content of a lesson by submitting it to The Incubator through Carpentries.[31]

The lessons from both Carpentries Incubator and CarpentriesLab can be taught in meetups, classes or as complements to a standard two-day Carpentries workshop. Independent learners can also benefit from the lessons, including those from outside the workshops.

Other language lessons

The Carpentries community has developed Spanish versions of its core lessons which are the Unix shell, version control with Git and R as a programming language. In 2021 the stable lessons available in Spanish are:

  • La Terminal de Unix[32]
  • El Control de Versiones con Git[33]
  • R para Análisis Científicos Reproducibles[34]

Funding

The Carpentries is fiscally sponsored by Community Initiatives[35] and funded through a combination of memberships, workshop fees, grants and donations. The Carpentries has over 70 member organizations,[36] including the Software Sustainability Institute,[37] the National Institute of Standards and Technology,[38] New Zealand eScience Infrastructure,[39] and Compute Canada.[40]

In November 2017, the Library Carpentry program received a supplemental Institute of Museum and Library Services grant, in partnership with the California Digital Library, valued at $249,553.[41][42]

In November 2019, the Chan Zuckerberg Initiative and the Gordon and Betty Moore Foundation announced a joint award of $2.65 million for The Carpentries.[43]

References

  1. Pugachev, Sarah (2019). "What Are "The Carpentries" and What Are They Doing in the Library?". Portal: Libraries and the Academy. 19 (2): 209–214. doi:10.1353/pla.2019.0011. ISSN 1530-7131. S2CID 146034351.
  2. Atwood, Thea P; Creamer, Andrew T.; Dull, Joshua; Goldman, Julie; Lee, Kristin; Leligdon, Lora C.; Oelker, Sarah K (2019). "Joining Together to Build More: The New England Software Carpentry Library Consortium". Journal of EScience Librarianship. 8 (1): e1161. doi:10.7191/jeslib.2019.1161.
  3. "Carpentries, Genomics, and Data Science training at the Smithsonian | Smithsonian Data Science Lab". datascience.si.edu. Retrieved 2019-11-10.
  4. "Supporting The Carpentries". ARDC. Retrieved 2019-11-10.
  5. "Software Carpentry at CERN (27-29 November 2019): Overview · Indico". Indico. Retrieved 2019-11-10.
  6. Perkel, Jeffrey M. (2018). "Software training in Antarctica". Nature. 560 (7719): 515. Bibcode:2018Natur.560..515P. doi:10.1038/d41586-018-06011-1. PMID 30127483. S2CID 52048713.
  7. Markel, Scott; Devenyi, Gabriel A.; Emonet, Rémi; Harris, Rayna M.; Hertweck, Kate L.; Irving, Damien; Milligan, Ian; Wilson, Greg (2018). "Ten simple rules for collaborative lesson development". PLOS Computational Biology. 14 (3): e1005963. arXiv:1707.02662. Bibcode:2018PLSCB..14E5963D. doi:10.1371/journal.pcbi.1005963. ISSN 1553-7358. PMC 5832188. PMID 29494585.
  8. Wilson, Gregory (2021). "The Third Bit". third-bit.com. “Start where you are, use what you have, help who you can”
  9. Wilson, Greg (2016). "Software Carpentry: lessons learned". F1000Research. 3: 62. doi:10.12688/f1000research.3-62.v2. ISSN 2046-1402. PMC 3976103. PMID 24715981.
  10. Pawlik, Aleksandra; van Gelder, Celia W.G.; Nenadic, Aleksandra; Palagi, Patricia M.; Korpelainen, Eija; Lijnzaad, Philip; Marek, Diana; Sansone, Susanna-Assunta; Hancock, John; Goble, Carole (2017). "Developing a strategy for computational lab skills training through Software and Data Carpentry: Experiences from the ELIXIR Pilot action". F1000Research. 6: 1040. doi:10.12688/f1000research.11718.1. ISSN 2046-1402. PMC 5516217. PMID 28781745.
  11. Labou, Stephanie; Otsuji, Reid (2019). "Expanding Library Resources for Data and Compute-Intensive Education and Research". 2019 15th International Conference on EScience (EScience). San Diego, CA, USA: IEEE: 646–647. doi:10.1109/eScience.2019.00100. ISBN 978-1-7281-2451-3. S2CID 214594737.
  12. National Academies Of Sciences, Engineering; Division of Behavioral Social Sciences Education; Board On Science, Education; Division on Engineering Physical Sciences; Committee on Applied Theoretical Statistics; Board on Mathematical Sciences Analytics; Computer Science Telecommunications Board; Committee on Envisioning the Data Science Discipline: The Undergraduate Perspective (2018). Data Science for Undergraduates: Opportunities and Options. Washington, DC: The National Academies Press. p. 55. doi:10.17226/25104. ISBN 978-0-309-47559-4. PMID 30407778. S2CID 86392049.
  13. Weaver, Belinda (2020). The efficacy and usefulness of software carpentry training: a follow-up cohort study (PDF) (master). Retrieved 2021-01-01.
  14. "Instructor Training". Retrieved 2021-07-02.{{cite web}}: CS1 maint: url-status (link)
  15. "The Unix Shell". swcarpentry.github.io.
  16. "Version Control with Git". swcarpentry.github.io.
  17. "Programming with Python". swcarpentry.github.io.
  18. "Plotting and Programming in Python". swcarpentry.github.io.
  19. "Programming with R". swcarpentry.github.io.
  20. "R for Reproducible Scientific Analysis". swcarpentry.github.io.
  21. "Ecology Workshop Overview". datacarpentry.org.
  22. "Genomics Workshop Overview". datacarpentry.org.
  23. "Social Science Workshop Overview". datacarpentry.org.
  24. "Geospatial Workshop Overview". datacarpentry.org.
  25. "Community Developed Lessons". The Carpentries.
  26. "Reproducible Computational Environments Using Containers: Introduction to Docker". carpentries-incubator.github.io.
  27. "Data Harvesting for Agriculture". carpentries-incubator.github.io.
  28. "From a Spreadsheet to a Database". carpentries-incubator.github.io.
  29. "Python for Humanities". carpentries-incubator.github.io.
  30. "Data processing and visualization for metagenomics". carpentries-incubator.github.io.
  31. "GitHub Repository". github.com. 9 November 2021.
  32. "La Terminal de Unix". swcarpentry.github.io.
  33. "El Control de Versiones con Git". swcarpentry.github.io.
  34. "R para Análisis Científicos Reproducibles". swcarpentry.github.io.
  35. "Fiscally Sponsored Projects". Community Initiatives. Retrieved 2019-11-10.
  36. "4TU.ResearchData | Expanding Researchers' software skills at Technical Universities across The Netherlands". researchdata.4tu.nl. Retrieved 2020-07-07.
  37. "The Carpentries and our partnership | Software Sustainability Institute". software.ac.uk. Retrieved 2019-11-11.
  38. Greene, Gretchen (2019-07-02). "Software and Data Carpentry". NIST. Retrieved 2019-11-11.
  39. "NeSI partners with Software Carpentry to expand research computing training". New Zealand eScience Infrastructure. Retrieved 2020-07-07.
  40. "Training | Compute Canada". 3 April 2015. Retrieved 2019-11-11.
  41. "Library Carpentry Receives Supplemental IMLS Grant – UC3 :: California Digital Library". Retrieved 2020-01-13.
  42. "RE-85-17-0121-17". Institute of Museum and Library Services. 2017-08-30. Retrieved 2020-01-13.
  43. "$2.65 million to expand computational research skills in science". Scienceboard.net. Retrieved 2019-11-11.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.