Monarch geneset OGS2.0

DPOGS209316
TranscriptDPOGS209316-TA3069 bp
ProteinDPOGS209316-PA1022 aa
Genomic positionDPSCF300234 - 42532-50134
RNAseq coverage477x (Rank: top 26%)
Annotation
HeliconiusHMEL0180735e-9668.06% 
BombyxBGIBMGA013817-TA0.055.64% 
DrosophilaNox-PC0.052.01% 
EBI UniRef50UniRef50_D6WP310.048.66%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WP31_TRICA
NCBI RefSeqXP_972375.20.050.79%PREDICTED: similar to AGAP008072-PA [Tribolium castaneum]
NCBI nr blastpgi|1892391620.050.79%PREDICTED: similar to AGAP008072-PA [Tribolium castaneum]
NCBI nr blastxgi|1892391620.050.61%PREDICTED: similar to AGAP008072-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055093.4e-33calcium ion binding
GO:00551142.3e-25oxidation-reduction process
GO:00164912.3e-25oxidoreductase activity
GO:00090555.8e-16electron carrier activity
GO:00055065.8e-16iron ion binding
GO:00506605.8e-16flavin adenine dinucleotide binding
GO:00160215.8e-16integral to membrane
KEGG pathway 
InterPro domain[130-294] IPR0119923.4e-33EF-hand-like domain
[837-1010] IPR0131212.3e-25Ferric reductase, NAD binding
[543-622] IPR0131128e-17FAD-binding 8
[372-502] IPR0131305.8e-16Flavoprotein transmembrane component
[533-614] IPR0179381.1e-14Riboflavin synthase-like beta-barrel
Orthology groupMCL11309 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209316-TA
ATGACAGCCGGTGACGACACTGACACCTGTCACATGACAGCTGATCGCGTTGACAGCACACACCCACGTGCTACGAGCATTTTATCAGACTTGTTGGAGCGAATCTTTCGTTTGTTCTGTGATGAAGAACATCTCGTCCAGGAGGAGTGGATAGAAGCACTCAAGGAGAGGTTACAAGAAGACAAACAACTCGACTTCGTGGAGCAAGTCGAGAGCGTCGCGTACGTACTCTGTGGAGACGGAGTGGTCACGAAGGAGACGTTCAGTAAGATATGGAAGAACAAGGTGGTGACGAACAAGATGTTCCGCGCGGTGGAAGGCGGGTCAGGCTCAGTGGGCGCCTCGGACCTCGTGGACCTCATCACGGCCGCCACCGACAGCAGAACCAGAGCCGGTATAGACAAGGAGTCATTGGATCGTTTGGAAAGATTGTTCAAGGAGACGGTCGGGGACCAGAGAGAGATCACCAGAGAACAGTTCCAGAAGATCGTCGTCTCCAAGAATCCCTTCTTCACTGAACGCGTGTTTCAAATCTTCGACGAAGATGACTCCGGATCCATCTCCCTCCGCGAGTTCCTCTCAGCGGCCGCTCGTGTCGCTTCGCGGTCACCCAGAGACAAGCTGAGGTTTCTCTTCCAAGTGTACGACCTGGACGGTGACGGGCTGATACAACACCGTGAACTGCAGCACGTGATGAGAGCCGCCATGGAGGAGAACAACATGAGCTTCTCAGAGGAACAGCTCCGAGACCTCACTAGCGCTCTGTTCGAAGACGCGGATGTTGATGGGCGAGGAGCCATCACGTGTGACGCGCTCCGAGACCAACTCGACAGACACGGGGGACTATTAGACAATCTCTCCATCAGTATAGACCGCTGGCTGGTGCCTCCTCCTCAACCGGAGCCTCTCGCCCTGCGCCAACGACTCGCCAACATGAGACCTTACCAACTGTCTCTAAACTATCTGAGGAACAACTATGTTTTCGTCTCCTACTTGGCGTTCTTCTTTCTCATGAACGTAGCTCTCTTCATTGCAAGGATTGTCGAATACTGGGATCACAATATTTTTGTTAAATTTGCTCGAGCTTCAGGCCAGTGTCTGAACTTCACATCATCGTGGGTGTTGGTGCTGATGCTGCGTCGAACCCTGACCTCTCTCCGTGAGCGCGGGGTGGTGGGAGCTCTCCTCGACCGTCACGTCTCCCTCCACAAGCTGGCGGCCCTCACCCTCATTGCACACGCCGCCCTGCACTCCGTTATGCATTTCTGTAACTTTGCGCTGGTGGTGGTGCCGCCCTCCCCACTACCACTCCACGTGTGGTTGCTGACCTCTCGTCCGGGAGTCTTCGGTCTGATTCAAGGCTGGGCTAACCCCACGGGTGTAGTTCTGGTGGTGGTGCTGGCAGCCTTGGCGTACTCCTCGCGACCAGCAGTCAGGAGAAGGGGTTGCTTCGAGGTGTTTTACTTCACCCATCTCCTGTACGTTCCGTTCTGGATTCTCCTCATCCTTCACGCACCCAATTTCTGGAAGTGGTTCATCGTCCCCGGAACCGTTTATCTCATCGAGAAGATGTTAAGGCTTTGGTGGTTGAGATCTGGTCGTGGCCGCAGTTACATAACATCCGGCGCCCTCCTTCCCTCCCGCGTCCTGCACGTGGCCGCACGCCGACCTCGAGGGTTCAACTTCCGCGCCGGAGATTATGTGTTTGTGAACGTGCCCGCTATAGCTCGATTCGAGTGGCATCCCTTCACCATCAGCAGTGCGCCCGAGGAGAAAGACTACATCTGGCTTCACATCCGAGGAGTGGGCGAGTGGACCAACAGACTGTATGAACACTTCGAACAACAAACGAAACGAGAACAAATACAAGAAAAAACGCATCGTAGCAATTCAACCCAGAGCAACAAAAGTAATCAGAGTAACAAGAGCAATAAAAGCAAGAAGTCCACCAAGAAACCTTCCATAGACTCCGGATACACTAATGAAGCGTACACCATTGATGAAGAAAACGAAAATGGCTGCGGTCTCAACTCAACTCCACAAACCCGTCCTCCGCTCTTGTCTCCTCTTCGCTTCCTGGAGAAGTCTCGCTCGATGCCCGACGTGAGGAAAAGTTTAAAGAAGAAACGGTTCATGATTCCTGAGTATCACCGCTCCGAGTCAGCCACCTTCCCCCCGGCGCCGGCCCCCTCCCCCTTGCCGCGCCCCTCTGCTCGCAGCCCCGCGGGCGCCCTCCACCTCGCAAACAGCTTCCGTCACCTGAGGACTAAGCCAGCCATCGTGTCGTGTGAGACGCCGCCTCTTAAGAAAGAAAGCTTACTGACCAGCGCGAGACGTCGCCTGTCAAAGACGCTCTCTCCAGACAGAGACGATGATGACAGACACGAAGACCCGGAGGCGGGAGACGGCGCGGAACACGAGGACCAACATGGACATACATACATGAAACCGTTAGAGATGTACATCGACGGTCCGTACGGCGCGGCTTCCTCGTTAATGTTTTCGTGCGAGCACGCGGTCTTAGTGGCGGCCGGGATCGGGGTGACTCCGTTCGCGTCGGTGTTACGGTCCATCGCCCACAGACTGTCCCGGGCGCAACACGCCTGCAGCAACTGCGGCCAGCGCTGCGTGACCCCGGACGCCGCCGACATCACGCTCAAGAAGGTCGACTTCATCTGGATCAATCGAGACCAGAGAGCCTTCGAGTGGTTCGTGTCGCTACTGTCGTCTCTGGAGATGCAGCAGGCCGAGCTGGAGCGGGCGGGAGGCCGGAGGTTCCTCGATATGCACATGTACGTCACCAGCGCTCTGCAGCGCTCCGACATGAGGGCGGTGTCGCTGCAGCTGGCGCTGGATCTGATGCATGGCAAGGACCAGCGGGATCTGGTGACCGGACTTAAGACCCGCACCGCCGCCGGAAGACCCAACTGGGACGTAGTCCTCCGGAGAGTGGCCGCCGGGCGGCAGGGACCCATCTCCGTGTTCTACTGTGGGCCCGCTCCTCTCGGGCGAGTGCTCCGGGACAAGTGCTGCCGGCTCGGGTACGACTTCCGGAAGGAGGTCTTTTGA

Protein sequence:

>DPOGS209316-PA
MTAGDDTDTCHMTADRVDSTHPRATSILSDLLERIFRLFCDEEHLVQEEWIEALKERLQEDKQLDFVEQVESVAYVLCGDGVVTKETFSKIWKNKVVTNKMFRAVEGGSGSVGASDLVDLITAATDSRTRAGIDKESLDRLERLFKETVGDQREITREQFQKIVVSKNPFFTERVFQIFDEDDSGSISLREFLSAAARVASRSPRDKLRFLFQVYDLDGDGLIQHRELQHVMRAAMEENNMSFSEEQLRDLTSALFEDADVDGRGAITCDALRDQLDRHGGLLDNLSISIDRWLVPPPQPEPLALRQRLANMRPYQLSLNYLRNNYVFVSYLAFFFLMNVALFIARIVEYWDHNIFVKFARASGQCLNFTSSWVLVLMLRRTLTSLRERGVVGALLDRHVSLHKLAALTLIAHAALHSVMHFCNFALVVVPPSPLPLHVWLLTSRPGVFGLIQGWANPTGVVLVVVLAALAYSSRPAVRRRGCFEVFYFTHLLYVPFWILLILHAPNFWKWFIVPGTVYLIEKMLRLWWLRSGRGRSYITSGALLPSRVLHVAARRPRGFNFRAGDYVFVNVPAIARFEWHPFTISSAPEEKDYIWLHIRGVGEWTNRLYEHFEQQTKREQIQEKTHRSNSTQSNKSNQSNKSNKSKKSTKKPSIDSGYTNEAYTIDEENENGCGLNSTPQTRPPLLSPLRFLEKSRSMPDVRKSLKKKRFMIPEYHRSESATFPPAPAPSPLPRPSARSPAGALHLANSFRHLRTKPAIVSCETPPLKKESLLTSARRRLSKTLSPDRDDDDRHEDPEAGDGAEHEDQHGHTYMKPLEMYIDGPYGAASSLMFSCEHAVLVAAGIGVTPFASVLRSIAHRLSRAQHACSNCGQRCVTPDAADITLKKVDFIWINRDQRAFEWFVSLLSSLEMQQAELERAGGRRFLDMHMYVTSALQRSDMRAVSLQLALDLMHGKDQRDLVTGLKTRTAAGRPNWDVVLRRVAAGRQGPISVFYCGPAPLGRVLRDKCCRLGYDFRKEVF-