I'm doing my first ever R course. One of the exercises is to create a random DNA strings with 1000 bases and for it compute the GC percentage (GC%).
I created the DNA bases vector and tried to create the sequence, however, the result is not correct
DNA <- c("A","G","T","C")
seq <- strrep(DNA, 250)
Any suggestion for this noob?
Basically you should check out very basics of R
, i.e sum
, sample
, paste
functions. The solution is quite simple:
DNA <- c("A","G","T","C")
chain <- sample(DNA, 1000, replace = T)
GCcontent <- sum(chain %in% c('G','C'))/length(chain)
# chain as a string
paste(chain, collapse="")
Cet article est collecté sur Internet, veuillez indiquer la source lors de la réimpression.
En cas d'infraction, veuillez [email protected] Supprimer.
laisse moi dire quelques mots