Please read the following:
s123456.R where
s123456 is your student number, with the questions clearly
marked with comments (using the # character). Please don’t
include any unnecessary code, only the code that is used to produce the
answer.The palmerpenguins data contains size measurements for
three penguin species observed on three islands in the Palmer
Archipelago, Antarctica. The data is available as an R package, but for
this exam, you can load the following CSV file:
https://mbdata.science.ru.nl/share/heeringen/R_exam_gbd/penguins.csv
How many observations does this data set contain?
On which island was the heaviest penguin observed?
Create a tibble with a new column that contains the penguin weight in kilograms.
Is there a significant correlation between bill length and bill depth for the penguin species Gentoo?
Create a barchart of the mean body mass per species.
Create a scatterplot of the penguin bill length against the body mass. Facet by species and color by sex.
We have the hypothesis that male Gentoo penguins will have a higher body mass than female Gentoo penguins. Use everything that you have learned to test this hypothesis. What is your conclusion?
In June 2021, Hotaling et al. published an inventory of insect genome assemblies. The accompanying data contains information such as assembly size, contig N50 and the type of sequencing technology that was used.
You can load this data set from the following CSV file:
https://mbdata.science.ru.nl/share/heeringen/R_exam_gbd/insects.csv
The contig N50 is a measure of genome quality. It is definied as the sequence length of the shortest contig at 50% of the total genome length.
The column BUSCO_complete contains information on gene
annotation quality. The BUSCO gene set contains genes that should be
conserved in all insects. The BUSCO_complete column defines
how many of these genes are present in the given assembly.
Which order has the largest mean genome size?
The following is a figure from the paper. Re-create this figure as best as you can. Note: you don’t have to re-create the specific grouping of the X-axis, you can group by order. In addition, you do not have to create the grey background shading.