Welcome to the course! I am looking forward to getting to know you this semester. To get started, please familiarize yourself with this syllabus and our website. I developed this course to provide a welcoming environment and effective learning experience for all students. If you encounter barriers in this course, please bring them to my attention so that I may work to address them, and reach out to me at any time if you have questions about course content or assignments.
This is a face to face course that will meet two times a week. Our class sessions include important information and opportunities to apply what you are learning with your peers. To best achieve the learning outcomes for the course, it is important that you attend class and be engaged with the content, me and each other. You should expect to spend approximately 9 hours each week on work for this course (including in and out of class time).
The scientific community widely acknowledges that we are in the midst of a reproducibility crisis (e.g. Baker, 2016). This course begins by reviewing the evidence for, and causes of, this crisis, and aims to highlight key factors that can improve reproducibility in science—especially within Ecology, Evolution & Behavior. It also provides a platform to develop, practice, and strengthen science communication skills (e.g. Soltis et al., 2023 for some innovative ways to share your research).
The overarching aim of this course is to equip students with the theoretical knowledge and bioinformatics tools necessary to enhance transparency, reproducibility, and efficiency in scientific research. Throughout the course, students will learn to use open-source software for research, including R, RStudio and R Markdown (incl. knitr). To further develop these skills—and improve communication and teaching competencies—students will
Overall, this course provides students with key knowledge to gather, store, share, prepare, and analyze data, as well as communicate results to the scientific community and various stakeholders (see Figure 5.1).
Figure 5.1: Overview of workflow studied in PART2.
The course is subdivided into three parts:
Part 1 provides students with key theoretical knowledge on reproducible science, enabling them to design and implement a reproducible approach tailored to Ecology, Evolution & Behavior. This section also covers topics related to open science and data management, and how these practices intersect with scientific publishing.
Part 2 offers students the opportunity to further develop and apply coding and bioinformatics tools essential for building reproducible workflows (Figure 5.1). In this section, students—supported by the instructor—will develop and teach a tutorial on a specific bioinformatics topic over two full classes. Tutorials will be written in RMarkdown and distributed to the class one week in advance. Depending on class size, this assignment may be completed individually or in groups.
Part 3 is designed to give students the chance to develop an individual reproducible workflow tailored to their own research project (or using data from a publication, if they have not yet defined a thesis topic). This assignment builds on knowledge gained in the earlier parts of the course and will involve collaboration with the instructor—and in some cases, with thesis advisors.
PART 1: The Big Picture
PART 2: Bioinformatics for Reproducible Science
PART 3: Apply a Reproducible Approach to your Data
Learning outcomes for each chapter are available on the course website.
The tentative schedule for this course is available here. The instructor wants to warn students that he might adjust the schedule to accommodate any needs. However, in case of changes, the instructor will make sure to contact enrolled students to keep them posted.
A shared Google Drive has been set up for data sharing and uploading bioinformatic tutorials.
The reading material at the basis of this course is composed of a mixture of publications and chapters mostly from two textbooks (Gandrud, 2015; Wickham and Grolemund, 2017). We will also study the “Guides to” published by the British Ecological Society. Please find below the references used in each chapter. This list is not exhaustive and additional literature will be provided in class.
Chapter | Reference(s) |
---|---|
Chap. 1 | Chapter 3 of Gandrud (2015) |
Chap. 2 | Baker (2016); Freedman et al. (2015); Munafo et al. (2017); Peng and Hicks (2021); Sarewitz (2016) |
Chap. 3 | Bone et al. (2015); Markowetz (2015); Smith et al. (2016) |
Chap. 4 | Carroll et al. (2021); Creative Commons; Wagner et al. (2022); Williams et al. (2023) |
Chap. 5 | British Ecological Society (2014a) & Chapter 4 of Gandrud (2015); British Ecological Society (2014d) & Chapter 2 of Gandrud (2015); Trisovic et al. (2022) |
Chap. 6 | British Ecological Society (2014b); British Ecological Society (2014c) |
Chap. 7 | Chapter 6 of Gandrud (2015) |
Chap. 8 | Chapter 7 of Gandrud (2015) & Chapters 9-10 of Wickham and Grolemund (2017) |
Chap. 9 | Chapter 8 of Gandrud (2015) |
Chap. 10 | Chapter 9 of Gandrud (2015) |
Chap. 11 | Chapter 10 of Gandrud (2015), Chapters 1 and 22 of Wickham and Grolemund (2017) & Guangchuang et al. (2017) |
Chap. 12 | Chapter 5 of Gandrud (2015) |
To further illustrate how a reproducible approach can be applied to research, the instructor provides examples of publications and software produced by students who have taken this course (listed alphabetically):
Research is often presented in the form of slideshows, articles or books. These presentation documents announce a project’s findings, but they are not the research, they are the advertisement part of the research project!
The research is the full software environment, code, and data that produced the results (Donoho, 2010).
When we separate the research from its advertisement, we are making it difficult for others to verify the findings by reproducing them.
This course will equip you with tools to integrate your research with clear and reproducible presentation of your findings. The first is a reproducible research workflow, which applies the principles of reproducibility throughout your entire project—from data collection to statistical analysis and results presentation. To support this, you will learn to use a range of computing tools that make this workflow possible.
The main bioinformatics tools covered in this course are:
As shown above, R and RStudio are at the core of this course and will have to be installed on your computers. This can be easily done by downloading the software from the following websites:
The download pages for these software tools include detailed installation instructions; please refer to them for more information.
If you are planning to create LaTeX documents, you will need to install a Tex distribution. Please refer to this website for more details: https://www.latex-project.org/get/
If you want to create Markdown documents you can separately install the rmarkdown package in R (see below for more details).
We will be using several R packages specifically designed to support reproducible research. Many of these packages are not included in the default R installation and must be installed separately.
To install the core packages used in this course, copy the following code and paste it into your R console:
install.packages(c("brew", "countrycode", "devtools", "dplyr", "ggplot2", "googleVis",
"knitr", "rmarkdown", "tidyr", "xtable"))
Once you run this code, you may be prompted to select a CRAN “mirror” to download the packages from. Simply choose the mirror closest to your location.
It is also likely that we will need to install additional packages throughout the course. When that happens, you can install them using the same R function install.packages()
, or through RStudio by selecting “Tools” → “Install Packages…”, entering the package name in the dialog box, and ensuring the “Install dependencies” option is checked.
RStudio offers a collection of cheat sheets accessible from the “Help” menu by selecting “Cheatsheets.”
Five cheat sheets are especially relevant to chapters taught in this course:
These documents together with the material presented in course resources will provide the basis to design your bioinformatics tutorials.
Please find below two documents providing a comprehensive introduction to R:
There will not be any classical exams in this course, but we will rather focus on developing theoretical and bioinformatics skills and applying those to your research. In this context, each student will be asked to produce a bioinformatics tutorial and teach it to their peers (see Course Content - PART 2). Each student will also be tasked to produce a report (tailored to their thesis project or a publication) and present their results and conclusions in class.
Students will be graded on the following four mandatory assessments:
These assessments total 550 points. Table 10.1 shows the grading scale used in this course.
The grading scale for this course is in Table 10.1.
Percentage | Grade |
---|---|
100-98 | A+ |
97.9-93 | A |
92.9-90 | A- |
89.9-88 | B+ |
87.9-83 | B |
82.9-80 | B- |
79.9-78 | C+ |
77.9-73 | C |
72.9-70 | C- |
69.9-68 | D+ |
67.9-60 | D |
59.9-0 | F |
You can look at all of your scores by accessing Grades in the Canvas course menu.
During the first two weeks, students will sign up for a chapter from Course Content - PART 2 to study and create a bioinformatics tutorial. Depending on enrollment, tutorials will be developed individually or in groups, with support from the instructor (see details below).
Tutorials must be written using knitr/rmarkdown in RStudio and focus on a set of exercises designed to build key bioinformatics skills related to the chosen chapter (see Course Content - PART 2). Students are encouraged to use materials from Gandrud (2015) and Wickham and Grolemund (2017), or other properly cited sources. For additional resources, see the Publications and Textbooks and RStudio Cheat Sheets sections.
Each tutorial should be designed to fit within two laboratory sessions (see Teaching Tutorials). Tutorials must be submitted to the instructor one week before presentation for review and uploading to the shared Google Drive.
When designing your tutorials, please consider the following:
When explaining each concept, clearly describe the rationale, outline the steps (using pseudocode if helpful), and emphasize the input/output and data type or format for each function.
Based on the information provided above, your tutorial should include:
Students are expected to prepare a 10–20 minute presentation that provides general guidance on completing their tutorial. Presentations will be uploaded to the shared Google Drive and made accessible to all students. During the tutorial sessions, students must run through their tutorial with their peers, ensuring they understand the key concepts and are able to complete the assignments or tasks. Students are expected to support their peers by answering questions throughout. While the instructor will also be available to assist, students will take the lead in teaching the bioinformatics laboratories.
Grading will be based on students’ ability to effectively teach their tutorials, facilitate understanding, and respond to questions. The instructor may also consider peer feedback when assigning grades for this component.
Students will collaborate with the instructor to develop a reproducible workflow tailored to their thesis project. For students who do not yet have a defined thesis topic, the instructor will help select a relevant publication to serve as the basis for their individual project. The instructor encourages students to reach out as soon as possible to begin designing their individual projects.
Reports should be written using knitr/rmarkdown in RStudio. Students must clearly state the rationale, objectives, and the scientific question at the core of their report. Framing the project within the scientific process is essential, as it is impossible to evaluate reproducibility without this context.
Additionally, students are expected to include a list of references supporting their report. References must be properly cited within the text; simply listing them at the end is not sufficient. This practice helps justify methodological choices and enhances transparency.
When considering your individual project, keep the following questions in mind:
Each student will have to present their report during final week. The presentation should follow the same structure as the report and not exceed 15 minutes. There will be 5 minutes at the end of the presentation allocated for questions.
My goal is that every student is successful in this course, but I need your help to achieve that. In order to do your part to ensure your success in this course, please:
If you are unable to attend class, please contact the instructor as soon as possible via email at svenbuerki@boisestate.edu.
In support of my goal that every student be successful in this course, you can expect that I:
Students in this class represent a rich variety of backgrounds and perspectives. The (program/dept) is committed to providing an environment where similarities and differences are respected, supported, and valued. While working together to build this community, we ask all members to:
As your instructor, my goal is to make sure that our learning environment is effective for everyone. This means, in part, that each student is encouraged to share perspectives relevant to the course material and that our class activities and discussions are conducted in a way that supports everyone’s learning.
If you are struggling for any reason (e.g., family emergency, financial/basic needs, mental/physical health concerns, caregiving responsibilities, etc.) and believe these struggles may impact your performance in the course, I encourage you to reach out to me if you are comfortable doing so, and I will refer you to an appropriate university resource. You may also reach out directly to the outreach team in the Office of the Dean of Students at (208) 426-1527 or email studentoutreach@boisestate.edu for support. The Student Life Essentials page is also a great place to find helpful resources. If you notice a significant change in your mood, sleep, feelings of hopelessness or a lack of self worth, consider connecting immediately with Counseling Services (1529 Belmont Street, Norco Building) at (208) 426-1459 or email healthservices@boisestate.edu.
The university has many resources designed to support you as a learner and human being. Among these are:
Academic Excellence is a Shared Value at Boise State, and part of your responsibility in pursuing academic excellence includes avoiding cheating, plagiarism, and any other kind of academic misconduct. If I find a student responsible for academic misconduct in our class, the outcome of their choice to not fully engage in their learning might range from a ‘revise & resubmit’ up to an ‘F (failure) for the course.’ For more info, please read The Student Code of Conduct (Policy 2020), Section 7: Academic Misconduct Complaints, Violations, Processes and Sanctions.
In this course, I want to see your thoughts, understand your reasoning, and hear your voice. However, there are moments in this course where you might find it useful to use generative AI tools in support of your learning.
You may use generative AI tools for specified activities and assignments if their use supports, rather than undermines, your learning. While generative AI can help to advance your learning, its usefulness depends on the purpose of each activity or assignment. You will find guidelines for generative AI use in the instructions for each assignment; please read them very carefully, as these guidelines differ by assignment.
If you use ChatGPT, Gemini, Grammarly, Midjourney, or other AI tools in support of your work in this course, cite any ideas, text, images, or other media generated by the tool using the instructions and format of the Modern Language Association (MLA), American Psychological Association (APA), Chicago Manual of Style, or other citation style as appropriate. When you use a tool in an assignment, include a brief, clear description of how you used it. If you use generative AI, you must not let this tool replace your thinking and work. In fact, it is your responsibility to ensure you are fully engaging in learning and submitting authentic work. To learn more about how to learn successfully and avoid academic misconduct behaviors, please review the Student Code of Conduct with special attention to Section 8: Procedures for Academic Misconduct.
If you are unsure of whether or when to use generative AI tools in this course, please reach out to me. I’m eager to learn about how we might use them in new ways to meaningfully advance your learning and prepare you for your future beyond Boise State.
Boise State has a Communicable Disease Policy (Policy 9270) that guides everyone working and learning in our community. The policy has two key implications:
Due dates for every assignment are provided on the course schedule. Unless otherwise stated, assignments are due on those days. However, I recognize that things can happen that are out of your control. In these instances, please reach out to me ahead of the due date, or as soon as possible afterwards, so that we can make a plan for you to submit the assignment as close to the original due date as possible. If an assignment is not submitted within two weeks of the due date, and you have not contacted me about when you will be submitting it, I will enter the assignment as Incomplete in Canvas until such time that we discuss the missing assignment and agree on a course of action.
Under Idaho law (Section § 67-5909D), some university courses with content related to diversity, equity, inclusion, or critical theory may be subject to certain restrictions. However, the law affirms and does not limit free discussion in the learning environment. Like all Boise State courses, this course supports open inquiry, intellectual honesty, and respectful engagement with a range of perspectives, all of which are consistent with student rights and responsibilities described in the Student Code of Conduct (Policy 2020).
This course may include content that touches on concepts related to diversity, equity, inclusion (DEI), or critical theory—such as systemic inequality, cultural identity, or gender and race in society. If these topics are included, it is because they are relevant to the learning outcomes for this course and are explored to support critical thinking, deeper understanding, and respectful engagement with different perspectives. As part of the course, you may be asked to apply or explain ideas that come from a particular perspective. However, you are not required to adopt such perspectives as your own.
Our learning environment is a space for open dialogue and thoughtful discussion, including complex or challenging topics. Everyone is expected to engage with curiosity, listen respectfully, and contribute in ways that support a productive and welcoming learning environment. Boise State and the Idaho State Board of Education affirm the importance of free expression and academic inquiry. As outlined in SBOE Policy III.B:
“Membership in the academic community imposes on administrators, faculty members, other institutional employees, and students an obligation to respect the dignity of others, to acknowledge the right of others to express differing opinions, and to foster and defend intellectual honesty, freedom of inquiry and instruction, and free expression on and off the campus of an institution.”
Disruptive behavior that interferes with the learning environment will not be tolerated and may result in removal from this course, in line with university policy (See Policy 3240 Maintaining Effective Learning Environments).
In this course, I will foster critical discussion and analysis, and a respectful consideration of a wide range of ideas, in accordance with the Faculty Code of Rights, Responsibilities, and Conduct (Policy 4000). You are encouraged to think critically, question ideas, and form your own conclusions. As always, you have the freedom to choose courses that align with your academic goals—if you have concerns about course content, please talk with your instructor or advisor. Refer to the academic calendar for important deadlines related to course withdrawal.
To learn more about the law and its impact at Boise State, visit the Provost Office’s Information Regarding Section 67-5909D page.
Citations of all R packages used to generate this report.
[1] J. Allaire, Y. Xie, C. Dervieux, et al. rmarkdown: Dynamic Documents for R. R package version 2.25. 2023. https://github.com/rstudio/rmarkdown.
[2] C. Boettiger. knitcitations: Citations for Knitr Markdown Files. R package version 1.0.12. 2021. https://github.com/cboettig/knitcitations.
[3] M. C. Koohafkan. kfigr: Integrated Code Chunk Anchoring and Referencing for R Markdown Documents. R package version 1.2.1. 2021. https://github.com/mkoohafkan/kfigr.
[4] R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Vienna, Austria, 2022. https://www.R-project.org/.
[5] H. Wickham, J. Bryan, M. Barrett, et al. usethis: Automate Package and Project Setup. R package version 2.2.2. 2023. https://usethis.r-lib.org.
[6] H. Wickham, R. François, L. Henry, et al. dplyr: A Grammar of Data Manipulation. R package version 1.1.4. 2023. https://dplyr.tidyverse.org.
[7] H. Wickham, J. Hester, W. Chang, et al. devtools: Tools to Make Developing R Packages Easier. R package version 2.4.5. 2022. https://devtools.r-lib.org/.
[8] Y. Xie. bookdown: Authoring Books and Technical Documents with R Markdown. Boca Raton, Florida: Chapman and Hall/CRC, 2016. ISBN: 978-1138700109. https://bookdown.org/yihui/bookdown.
[9] Y. Xie. bookdown: Authoring Books and Technical Documents with R Markdown. R package version 0.36. 2023. https://github.com/rstudio/bookdown.
[10] Y. Xie. Dynamic Documents with R and knitr. 2nd. ISBN 978-1498716963. Boca Raton, Florida: Chapman and Hall/CRC, 2015. https://yihui.org/knitr/.
[11] Y. Xie. formatR: Format R Code Automatically. R package version 1.14. 2023. https://github.com/yihui/formatR.
[12] Y. Xie. “knitr: A Comprehensive Tool for Reproducible Research in R”. In: Implementing Reproducible Computational Research. Ed. by V. Stodden, F. Leisch and R. D. Peng. ISBN 978-1466561595. Chapman and Hall/CRC, 2014.
[13] Y. Xie. knitr: A General-Purpose Package for Dynamic Report Generation in R. R package version 1.44. 2023. https://yihui.org/knitr/.
[14] Y. Xie and J. Allaire. tufte: Tufte’s Styles for R Markdown Documents. R package version 0.13. 2023. https://github.com/rstudio/tufte.
[15] Y. Xie, J. Allaire, and G. Grolemund. R Markdown: The Definitive Guide. Boca Raton, Florida: Chapman and Hall/CRC, 2018. ISBN: 9781138359338. https://bookdown.org/yihui/rmarkdown.
[16] Y. Xie, C. Dervieux, and E. Riederer. R Markdown Cookbook. Boca Raton, Florida: Chapman and Hall/CRC, 2020. ISBN: 9780367563837. https://bookdown.org/yihui/rmarkdown-cookbook.
[17] H. Zhu. kableExtra: Construct Complex Table with kable and Pipe Syntax. R package version 1.3.4. 2021. http://haozhu233.github.io/kableExtra/.