Monarch geneset OGS2.0

DPOGS201838
TranscriptDPOGS201838-TA1764 bp
ProteinDPOGS201838-PA587 aa
Genomic positionDPSCF300191 - 458558-462416
RNAseq coverage549x (Rank: top 23%)
Annotation
HeliconiusHMEL0147460.097.57% 
BombyxBGIBMGA006093-TA0.096.72% 
Drosophiladbo-PA0.085.89% 
EBI UniRef50UniRef50_G6CZH50.0100.00%Putative uncharacterized protein n=3 Tax=Coelomata RepID=G6CZH5_DANPL
NCBI RefSeqXP_001661146.10.086.40%BACH1, putative [Aedes aegypti]
NCBI nr blastpgi|1571277160.086.40%BACH1, putative [Aedes aegypti]
NCBI nr blastxgi|1571277160.086.40%BACH1, putative [Aedes aegypti]
Group
Gene OntologyGO:00055152.5e-33protein binding
KEGG pathway 
InterPro domain[1-575] IPR0170961.1e-292Kelch-like protein, gigaxonin
[258-569] IPR0159161.5e-85Galactose oxidase, beta-propeller
[140-242] IPR0117053.1e-42BTB/Kelch-associated
[15-134] IPR0113331.9e-37BTB/POZ fold
[29-134] IPR0130692.5e-33BTB/POZ
[38-135] IPR0002101.3e-32BTB/POZ-like
[431-477] IPR0066521.9e-18Kelch repeat type 1
Orthology groupMCL13608 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201838-TA
ATGGGTGAGGGCGGGTCCCCGGGCGGCGGGGCGCGGCTGTCTCACACGTCCGAGAAGCACCCGCGAGCCATCCTCGGGGAACTGTCCGCACTGAGGAGGCACAGGGAACTGTGCGATGTGGTCCTCAATGTTGCCAATAGAAAATTATTTGCACATCGCGTGATCCTGTCAGCGTGCAGTCCGTACTTCCGCGCCATGTTCACGGGCGAGCTGGCGGAGTCGCGCGCCACCGAGGTGACCATCCGCGATGTGGACGAGCACGCCATGGAGCAGCTGGTGGAGTTCTGTTACACGGCGCACGTGGTGGTGGAGGAGAGCAACGTGCAGGCGCTGCTGCCGGCCGCCTGCCTGCTGCAGCTGCAGGAGATCCAGGACGTGTGCTGCGAGTTCCTCAAGCGGCAGCTGGACTGCTCCAACTGTCTGGGCATCAGGGCCTTCGCCGACACGCATTCCTGTAGGGAACTGCTGAGGATAGCGGACAAGTTCACACAGCAAAACTTCCCAGAGGTGATGGAGAGCGAGGAGTTCCTGCTGCTGCCGGCCGCTCAGCTCATAGACATAGTGTCGTCCGACGAACTCAACGTGCGCTCCGAGGAACAGACCTTCCAGGCCGTCATGTCCTGGGTCAAGTACAACGTGGCTGAGAGGAGGCAGCATCTCGCACAGGTGTTACAACACGTCCGTCTGCCGCTCTTGAGTCCAAAGTTTCTAGTGGGGACCGTGTCCTCCGAGCTGCTGATACGATCAGACGACGCGTGTCGCGACTTATTGGACGAGGCCAAGAACTACCTCCTGCTGCCTCAGGAACGACCACTCATGCAGGGACCTCGCACCAGGCCCAGGAAACCCACGCGTAGAGGGGAGGTGTTGTTCGCGGTGGGCGGCTGGTGCTCGGGCGACGCCATCGCGTCCGTGGAGCGCTTCGAGCCCGCCACCGCCGAGTGGAAGATGGTCGCGCCCATGTCCAAGAGGCGCTGCGGCGTGGGCGTGGCCGTGCTGCACGACCTGTTGTACGCCGTCGGCGGCCACGACGGGCAGAGCTACCTCAACAGCATCGAGCGCTACGACCCTCAGACCAACCAGTGGTGCGGGGCGGTCGCGCCCACGTCCTCGTGCCGCACCTCCGTGGGCGTGGCCGTGCTGGATGGGGCGCTGTACGCGGTGGGCGGCCAGGACGGAGTGCAGTGCCTCAACCACGTGGAGCGGTACGACCCCAAGGAGAACCGCTGGACCAAAGTGGCCGCCATGACGACGCGGCGCCTCGGCGTTGCCGTGGCGGTTCTGGGAGGACATCTATACGCCGTCGGCGGCTCCGACGGCCAGTCCCCTCTCAACACGGTGGAGCGTTACGACCCTCGCGCCAACAAGTGGACGGCGGTGGCCCCGATGTCGACTCGCCGGAAGCACCTCGGCTGTGCAGTGTTCGACGGACAGATATACGCTGTGGGCGGACGAGACGACTGTACGGAGCTCTCCTCCGCTGAGAGGTATGAGCCGGCGACGGACTCGTGGTCGCCGGTGGTGGCGATGACGTCACGCCGCAGTGGCGTGGGCCTGGCTGTGGTCAACGGACAACTGTACGCGGTCGGAGGGTTCGACGGAACGGCCTACCTCAAGTCCATAGAGGTGTTCGATCCTGAAGCGAATCAATGGCGGTTGTGTGGAGCCATGAACTACAGACGTTTGGGAGGAGGAGTCGGCGTCATGAGGGCGCCGCACCACGATAACCATTACATATGGAATCGCAAAGACTCCGTGGTGTGA

Protein sequence:

>DPOGS201838-PA
MGEGGSPGGGARLSHTSEKHPRAILGELSALRRHRELCDVVLNVANRKLFAHRVILSACSPYFRAMFTGELAESRATEVTIRDVDEHAMEQLVEFCYTAHVVVEESNVQALLPAACLLQLQEIQDVCCEFLKRQLDCSNCLGIRAFADTHSCRELLRIADKFTQQNFPEVMESEEFLLLPAAQLIDIVSSDELNVRSEEQTFQAVMSWVKYNVAERRQHLAQVLQHVRLPLLSPKFLVGTVSSELLIRSDDACRDLLDEAKNYLLLPQERPLMQGPRTRPRKPTRRGEVLFAVGGWCSGDAIASVERFEPATAEWKMVAPMSKRRCGVGVAVLHDLLYAVGGHDGQSYLNSIERYDPQTNQWCGAVAPTSSCRTSVGVAVLDGALYAVGGQDGVQCLNHVERYDPKENRWTKVAAMTTRRLGVAVAVLGGHLYAVGGSDGQSPLNTVERYDPRANKWTAVAPMSTRRKHLGCAVFDGQIYAVGGRDDCTELSSAERYEPATDSWSPVVAMTSRRSGVGLAVVNGQLYAVGGFDGTAYLKSIEVFDPEANQWRLCGAMNYRRLGGGVGVMRAPHHDNHYIWNRKDSVV-