![Can the nucleotide content of a DNA sequence predict the sequence accessibility?](/rails/active_storage/representations/proxy/eyJfcmFpbHMiOnsibWVzc2FnZSI6IkJBaHBBbkVOIiwiZXhwIjpudWxsLCJwdXIiOiJibG9iX2lkIn19--7f7a78aa87e1e646507b62f8b6b5c083594c4ecf/eyJfcmFpbHMiOnsibWVzc2FnZSI6IkJBaDdCem9MWm05eWJXRjBTU0lKYW5CbFp3WTZCa1ZVT2d0eVpYTnBlbVZKSWcwMk1EQjROakF3UGdZN0JsUT0iLCJleHAiOm51bGwsInB1ciI6InZhcmlhdGlvbiJ9fQ==--52131d7e1d9fc5c464ef2fd793e4b0873b571622/warren-umoh--qycBqByWIY-unsplash.jpeg)
Sequence accessibility is an important factor affecting gene expression. Sequence accessibility or openness impacts the likelihood that a gene is transcribed and translated into a protein and performs functions and manifests traits. There are many potential factors that affect the accessibility of a gene. In this study, our hypothesis was that the content of nucleotides in a genetic sequence predicts its accessibility. Using a machine learning linear regression model, we studied the relationship between nucleotide content and accessibility.
Read More...