Monarch geneset OGS2.0

DPOGS209542
TranscriptDPOGS209542-TA2250 bp
ProteinDPOGS209542-PA749 aa
Genomic positionDPSCF300092 - 265861-277786
RNAseq coverage663x (Rank: top 19%)
Annotation
HeliconiusHMEL0054302e-8096.53% 
BombyxBGIBMGA012411-TA5e-17091.77% 
DrosophilaCG6966-PB0.051.93% 
EBI UniRef50UniRef50_Q7PS930.057.95%AGAP003839-PA n=14 Tax=Coelomata RepID=Q7PS93_ANOGA
NCBI RefSeqXP_001811696.10.061.56%PREDICTED: similar to sex-determining protein fem-1 [Tribolium castaneum]
NCBI nr blastpgi|1892348290.061.56%PREDICTED: similar to sex-determining protein fem-1 [Tribolium castaneum]
NCBI nr blastxgi|1892348290.061.35%PREDICTED: similar to sex-determining protein fem-1 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.3e-06protein binding
KEGG pathway 
InterPro domain[146-343] IPR0206831.8e-57Ankyrin repeat-containing domain
[255-284] IPR0021101.3e-06Ankyrin repeat
Orthology groupMCL11481 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209542-TA
ATGCAGTTGGAAAGGCTAGGGTTGCGTAACGGTTGTGTAACCAGTGAAAGAGTCTGCAGCCTGCAGTGTTGCATTCGCTTTGTCAGAGAGCGGACGTGCGCGCGCGCATATCGTTACCAGTACTCTGTTAATTGCCACGTGGCATTTGTGTATAGTAATGTGATAATGTCTAATGGTGTGTGGGACTTGATGGATTTGATGCCGGTGAAGTGCCGCTGGATTGAGAGGTTGAGGAGGTGCGGGAAGACTCTACTTCGAAGACAGCGGAGATTGATAAGACGCTTGAAGAAGTGTCGAAAGAACAAAGATGTCTACTGTCGCGACATTGAACCCGGCATACTTACGAAAGCCGCCGACAATGCTCATTGTGAAAGAGATCCCATGTACCTGCACAGCTGGCGTTCAGACGAGCTCAGCATGTTGGTGGGGGCGAAGGTGACGGGTGCTACGCCCCTAGTTATCGCCTGTCGCAACGGACACTACGACGTCGCTGAATACCTCATAGAGAGGTGCAAGGCTGATATTGAACAGCCCGGGTCGGTGACATTTGATGGGGAGACGATAGAGGGTGCTCCGCCCCTGTGGTGCGCGGCGGCCGCTGGTCACATGCCGCTAGTGAGATTGCTGGTGAGAGCTGGTGCTAACGTCAACTCAACTACCCGTACACACAGCACACCGCTCAGGGCAGCATGCTTCGATGGGCACTATGATATAGTGAAGTTCCTCGTTGAGAACGGTGCCGATATCGAGATAGCAAATCGTCACGGTCATACCTGTCTCATGATAGCCTGTTACAAGGGTCACATACAGATAGCCAAGTATCTGCTGTCGCTGAACGCTGATGTGAACAGGAAGAGTGTGAAGGGGAACACGGCATTACATGACTGCGCCGAGAGCGGCTCGCTGCATATACTGAAGATGTTACTCGCCCACGGCGCTACGATGGACGTTGATTCGTATGGCATGACACCGCTACTGGCTGCGTCAGTAACCGGTCACACTCACATAGTGGAGCTCCTCATCAACATAGAACACGGTCTTGTGACGAGGCAGGAACGGATAGACGCCCTGGAACTCCTCGGCGCCACTTACGTTGATAAAAGGAGGGATATGGTCGGAGCTTTGGCGCTATGGAAACGGGCTATGGCATACAGGTTTCCAGACGACGATCGTGAACCTATCCCCAAGCCGAAAGACGTTCCCCGTATAGAAGCTTACGAGTATGCAGTGGAGCCCGGCGATGCTCGCCAGTTGGAGGAATTGCTGGCCGACCCTGACGCCATGAGGATGCAGGCCCTCGTCATACGCGAAAGGATACTCGGCCCCGCCCATCCGGACACGTCATACTACGTCCGTTACCGAGGCGCGGTGTACGCGGACGCGGGTCGTTTCGGTCGCTGTCGTACTCTGTGGCACCACGCTCTGGACATGCAGCGAGCTGTCCTCCCTCCCCTGTCTCCCCTCACACAGTCCAGCTTGTATAGCTTCGCGGAGTTGTTCTCCTACATGCTGGCTGAAAGGACGAGACCGCCGCTTAGAGGTCGAATAGTTCCTCCGGTGACGTTCGAGGACATAGATCCAGTGTTCGTGAAAGCGTTGTCGGAAATACACAGGGGCATGGAGTTACTAGAGTCAAAATTAACTATCGACAGGGAGCAGACACTTGTGACGTTACAGCGAGTGCTAGTTATATCTCTACACCTTGCTGCGCTCATGGCAAGACTGCTGGAGGAGCCGGGGTGCACCGACGACGTCTCCAGGAAGATACATAAGGCTGTTTACTCACTGGTTAAATTAGACATTAAGGTCCGTCAAGGTCGCTCAGCGCTCCACGTGGCGTGTTCGTCTGAGGCAAGCCGCGGGCGGGGCGGGGCGGAAGCGGCGAGCTCGTGGACGGCGGAGGCTGCGTGTCCCGCGCTCGTCGCACTCATGCTGCGGCTCGGAGCGTCGCCGGACGTGCGCGACGCGGACGGAAACACGCCCTTACATCTAGTCTGCAAGCTAAACCCGTGCCCTGCGGAGGTGGTTCGCGAGTTATTATCTCACGGAGCTCACATAGACACGGTCAACTATGAAGGTCAAACTCCTGAGGAGATATTAAAATCCACCCAACAGACGCTCTCGTCCATAGTGAACCCATTAAAATATACGACGCTCAAATGTCTCGCCGCGCGCACTGTTAAGAATTACAAACTACCCTACAGACATGTTGTGCCGCAGTGCTTACACTCGACTATAATTACTCATTAA

Protein sequence:

>DPOGS209542-PA
MQLERLGLRNGCVTSERVCSLQCCIRFVRERTCARAYRYQYSVNCHVAFVYSNVIMSNGVWDLMDLMPVKCRWIERLRRCGKTLLRRQRRLIRRLKKCRKNKDVYCRDIEPGILTKAADNAHCERDPMYLHSWRSDELSMLVGAKVTGATPLVIACRNGHYDVAEYLIERCKADIEQPGSVTFDGETIEGAPPLWCAAAAGHMPLVRLLVRAGANVNSTTRTHSTPLRAACFDGHYDIVKFLVENGADIEIANRHGHTCLMIACYKGHIQIAKYLLSLNADVNRKSVKGNTALHDCAESGSLHILKMLLAHGATMDVDSYGMTPLLAASVTGHTHIVELLINIEHGLVTRQERIDALELLGATYVDKRRDMVGALALWKRAMAYRFPDDDREPIPKPKDVPRIEAYEYAVEPGDARQLEELLADPDAMRMQALVIRERILGPAHPDTSYYVRYRGAVYADAGRFGRCRTLWHHALDMQRAVLPPLSPLTQSSLYSFAELFSYMLAERTRPPLRGRIVPPVTFEDIDPVFVKALSEIHRGMELLESKLTIDREQTLVTLQRVLVISLHLAALMARLLEEPGCTDDVSRKIHKAVYSLVKLDIKVRQGRSALHVACSSEASRGRGGAEAASSWTAEAACPALVALMLRLGASPDVRDADGNTPLHLVCKLNPCPAEVVRELLSHGAHIDTVNYEGQTPEEILKSTQQTLSSIVNPLKYTTLKCLAARTVKNYKLPYRHVVPQCLHSTIITH-