GitHub - maria-kromany/Finding-ORFs-with-Python

This project identifies Open Reading Frames (ORFs) in DNA sequences, which are potential gene regions that can code for proteins. ORFs start with "ATG" (start codon) and end with a stop codon (such as "TAA," "TAG," or "TGA"). By locating these segments, we can study gene functions and better understand DNA sequences.

The key tasks include: Parsing DNA Sequences from FASTA files, using the read_one_seq_fasta function to extract sequence data. Calculating GC Content via the gc_content function, which computes the percentage of guanine (G) and cytosine (C) bases. This helps filter biologically relevant ORFs. Identifying ORFs in both DNA strands. Functions like get_orf, one_frame, forward_frames, and reverse_complement work together to locate ORFs across multiple reading frames. Applying Thresholds with the gene_finder function to filter ORFs based on minimum length and GC content, narrowing down significant gene candidates.

The identified ORFs can be cross-checked with databases like GenBank to uncover biological functions, gene relevance, and human genome presence, connecting computational analysis to real genetic insights.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
X73525.fasta		X73525.fasta
gene_finder_test.fasta		gene_finder_test.fasta
human chromosome X partial.fasta		human chromosome X partial.fasta
human_chr9_segment.fasta		human_chr9_segment.fasta
project_01.py		project_01.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages