Monarch geneset OGS2.0

DPOGS203545
TranscriptDPOGS203545-TA2286 bp
ProteinDPOGS203545-PA761 aa
Genomic positionDPSCF300055 + 329970-380560
RNAseq coverage2055x (Rank: top 6%)
Annotation
HeliconiusHMEL0132080.080.22% 
BombyxBGIBMGA004353-TA0.066.50% 
Drosophilafus-PD2e-15343.99% 
EBI UniRef50UniRef50_Q0IEC28e-17154.90%Putative uncharacterized protein n=2 Tax=Culicinae RepID=Q0IEC2_AEDAE
NCBI RefSeqXP_970801.20.053.31%PREDICTED: similar to fusilli CG8205-PD [Tribolium castaneum]
NCBI nr blastpgi|2700099420.055.88%hypothetical protein TcasGA2_TC009268 [Tribolium castaneum]
NCBI nr blastxgi|2700099429e-17747.69%hypothetical protein TcasGA2_TC009268 [Tribolium castaneum]
Group
Gene OntologyGO:00001663.5e-06nucleotide binding
KEGG pathway 
InterPro domain[394-575] IPR0126773.5e-06Nucleotide-binding, alpha-beta plait
Orthology groupMCL11356 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203545-TA
ATGATTAACGGAGGCTGGGTGTGCGCGTTGAATGTCCGTACGGCGGGTCTAGGGGGCGCAGCGCTGGGTTCCGACGAGGAAGAGGTTGTCTATCTAGCCTATGTCGTAATAGACGTGCTCACCAACCAGGTGATTGGTGAGAGGGAGTATGCTGTCCGTCCGACGAGGAGGCCCTCCGAAGAGCTGCAGACTGGTCAGCCCCTGGATGTAGTGGTGCAACAGGTGGACGAGTTCGTCCACTCCCTGCAGGTGGACCCTCTCTCTCCACTGTTCAGACTCGTCACGGACGGTCAGCCGCCGCTCAGACAATGTCTACATCCGGAAGCGTGCTCGAAGGATATAACTCTGCCGCCCTACTACGCCCGCTTCCACGACCTCCGCAAGGAGTACGTGAGGGCGTACACTCTCCGCGCAGTCACCAGGTCCCAGCCGCCACCTCCGGACCATCCTAACAGCATCTCCGACATGATGGGATATGTAAGCGTTCCTTATAGTTGTATAGATCTAGGCATAACTCCCTACACCGGAGACAACTTCTATGCAGCCGAGGTCAAGGATATGGCGGCCATTATACAGAGGATCATAGCTGATGGCTTCAGACTAGAACTACCAGAAACGATCGACCTGGTCCTAGAAACTGGAATATGGCTGATAAGTGACGACCGTGCACCTCTACCACGATACTGTTTACATATCGTTAAGATATCGTTATATCGCATGGTAGCCCTTACAAGCGGCGAGTCGCGCGGGGCTCATTCTTCGTCCGGCTGTGTTGTTGTGTGGACGTGTGTTGTGTGGAGTGCTGTATTGAACTATCGCGTCAAGATAAAAAGCTCAAAGGACGATGAGATTGATGGGAACTGCATAGTGAGAGCTCGTGGTCTTCCTTGGCAGTCTTCGGACCAGGATATAGCTAAGTTCTTCAGGGGACTCAATGTTGCTAAGGGTGGGGTAGCTCTTTGTCTATCACCTCAGGGCAGACGGAACGGGGAGGCGTTGGTGCGTTTCGTGTCTCAGGAACACAGGGACATGGCGCTCAAGAGACACAAACACCACATCGGACCGCGGTACATCGAGGTGTACCGCGCCAGCGGTGAGGATTTCCTGTCAGTGGCCGGTGGCGCCACCTGTGAGGCTGCGGCCTTCCTATCTCGAGGGGCTCAGGTCATAGTGCGGATGAGAGGACTGCCCTACGACGCGACACCGCAGCAAGTTCTGGAGTTCTTCTCGTCGGGCGAGGAGCCGGTGCAGGTGTTGGACGGTGCGGACGGAGTGCTGTTCGTGCGCAGAGCTGACGGCCGGGCCACCGGGGACGCCTTTGTGCTGTTCAGCAAGGAGGCCGACGCTCCCAAGGCCCTCGCCAGGCACCGGAAACTCATAGGAGCCAGATATATCGAGCTCTTCAGGAGCACAACCGCTGAGGTCCAGCAGGTGTTAAATCGCTCGCTCGAGAGTCGCGGCCAGACGCCTGGTGCCCAGGAGCTGGTCCCCGTGACGCTCGTCCCACAACACGTTATCACTTCCGGTACAGCCAAAGACTGTGTGGAGCATATCTTGACCTTCCTGGACGAGTTCGCGAAGAACATTGTGATGCAGGGAGTTCACATGGTTTACAACGCGCAGGGTCACCCGAGTGGAGAGGCCTTCATCCAGATGGACAGTGAAGCCAGCGCCTTCCTCTGCGCCCAGCAGAAACACCACCGGTACATGACCTTCGGCAAGAAACAGAGATACATCGAGGTGTTCCAGTGCTCCGGGGACGACATGAACCTGGTGCTCACGGGCGGCGTGGGTCCTTCGCCCCCCAAGGTGTTGTCCCCCGGTACGTGTCTGCCCCCCGCGCCCCGGCTGCTCGCGCACCACATCGCTCAACAGAGCCTGCTGGCCCGACAGCACGAGAACCTCCTGCTGTCCATGCGGCCGCCCGTCCTGCCGCTCGTGCCCCTCCGCCCGCCCCCACCCGCTCAGACCTACCCCATACTCCCCCAGTCCATCCTCCCCGCCAAACGCAGCTACGACCGCGCCTTCGCCCCCGACCCGTCCCCCGCCAAGCGCCAGTACATCCAGCCGTCTCTCCCGCTGCCGATGCTCTCGTACTCCACGTCCCCGTCTATATTCTCGTACGCCAACACCGGGCCGATGTTTTCGTACAGCGCCCTGCCCGCCCTGCCGAGCGTCCCGGGCTACGGCACGGCGCTGTCCGCGGCTCTCCCGCCCACCATCACCAGCCCCTTCCCCGGCTCTCTGTCCATGTTCCCCACCTACCCCTACTACCCCGGGGTGTAG

Protein sequence:

>DPOGS203545-PA
MINGGWVCALNVRTAGLGGAALGSDEEEVVYLAYVVIDVLTNQVIGEREYAVRPTRRPSEELQTGQPLDVVVQQVDEFVHSLQVDPLSPLFRLVTDGQPPLRQCLHPEACSKDITLPPYYARFHDLRKEYVRAYTLRAVTRSQPPPPDHPNSISDMMGYVSVPYSCIDLGITPYTGDNFYAAEVKDMAAIIQRIIADGFRLELPETIDLVLETGIWLISDDRAPLPRYCLHIVKISLYRMVALTSGESRGAHSSSGCVVVWTCVVWSAVLNYRVKIKSSKDDEIDGNCIVRARGLPWQSSDQDIAKFFRGLNVAKGGVALCLSPQGRRNGEALVRFVSQEHRDMALKRHKHHIGPRYIEVYRASGEDFLSVAGGATCEAAAFLSRGAQVIVRMRGLPYDATPQQVLEFFSSGEEPVQVLDGADGVLFVRRADGRATGDAFVLFSKEADAPKALARHRKLIGARYIELFRSTTAEVQQVLNRSLESRGQTPGAQELVPVTLVPQHVITSGTAKDCVEHILTFLDEFAKNIVMQGVHMVYNAQGHPSGEAFIQMDSEASAFLCAQQKHHRYMTFGKKQRYIEVFQCSGDDMNLVLTGGVGPSPPKVLSPGTCLPPAPRLLAHHIAQQSLLARQHENLLLSMRPPVLPLVPLRPPPPAQTYPILPQSILPAKRSYDRAFAPDPSPAKRQYIQPSLPLPMLSYSTSPSIFSYANTGPMFSYSALPALPSVPGYGTALSAALPPTITSPFPGSLSMFPTYPYYPGV-