Monarch geneset OGS2.0

DPOGS203296
TranscriptDPOGS203296-TA3282 bp
ProteinDPOGS203296-PA1093 aa
Genomic positionDPSCF300003 - 1289947-1356539
RNAseq coverage120x (Rank: top 58%)
Annotation
HeliconiusHMEL0063770.072.80% 
BombyxBGIBMGA012247-TA0.074.26% 
DrosophilaCG32809-PD0.046.84% 
EBI UniRef50UniRef50_D6WJQ10.049.19%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WJQ1_TRICA
NCBI RefSeqXP_001966421.10.046.86%GF22008 [Drosophila ananassae]
NCBI nr blastpgi|2700066650.049.19%hypothetical protein TcasGA2_TC013023 [Tribolium castaneum]
NCBI nr blastxgi|2700066650.045.92%hypothetical protein TcasGA2_TC013023 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[181-260] IPR0227821.4e-10Actin interacting protein 3, C-terminal domain
Orthology groupMCL14110 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203296-TA
ATGTTAAGTAGGTGGAAAAGTAAAGACAAGTCAGAGAAATCTGGCAAGTCTTCAGGTTCGAGTAGCAAGAAGAAAAGAAAAGGTCGCGACAATGAAGAGGACTGGCAAAGCGATCCTAGACTTGACGGTGGTCAGTCTGACACAGAGGCAGGACGGGGGCCGCCGCGGGGTCGGCGTAACCGACGAGATGATCCCCGACGACACACACTTGCTGGCAACCATTTGTATACACAGCAAAGCGCGATTACTGGACTCGATGAGGAGAGATATCGTGTTGCGAACGTGAACATTCCAGAATACGCTGTTCCTGATAAAAATCGCATTATGAAGCCTTATCCACAAATATATCCTAGAATGAAAAAGCCGCCACTACCTCGGTATCTGCCGGCTCCCAGCAACATGTTGTTTGAGGACGATCCGGGCGTAATGTCCGAGGTTGAAACCTCATCCACTGGATTTAGACGCGGAGGGAAACAACGGTCTTCACTACCCGTTGTTCGTACCCCTAGCAAAACACTGGAGCGCCCCCTGGGTTTGGTTTTTCTACAATATCGTAGTGAGACCAAACGGGCGCTCCTACCAAATGAAGTCACCTCTATCGATACTGTAAAAGCACTTTTTGTAAGAAGTTTTCAAAAACAATTAACTATGCAATACATGGATAGCGATCACGTGAAGATATACATACACGACACATCGAAAGATATGTTCTACGAACTTGAAGACGACAGGGCACATCTACGGGATATTCGTGATCGCTCTGTACTACGACTATTTGAGGCAACAGACGTGGGCGGAGGAGTTTTTCCCGGTTCAGTCGGGGCAAGCTCCGTGGCCATGCCTTCAACTTCGTCCTGGGATCAGGACCAGAATTACTTCAGCGAACCTGAGATCGACTCGGACTACCATCACCAACATGTCCATAACAAGAGCAAAGTAAAGGCTCCTGCTTACTACATGGGCACCCCTGCTCCTAGTACCCTTCCGCGTGGAGGGTCTCTTCTTCGTGCTTTTTCTCCCGCTGCTCCCCCTGTTCCAGCTGATCGAGTCAAAGCTTTGCCACCAGGTCCTACAGCGCCCAGCAAACCGGTCCGCTCGTACATCCCGGGAAGCGGATCGGAGCGGTGTTTCCCCTTTGCCTCAGCTGAATGCCCTCGCGGTACCGACCAACTATACTCTATCCCTGCTCATTCTGGCGATGGATATATGTCGTCTCCCGAACGCAGCGCGCCACGGCCTTATGAAGACCCTTACTACACTCAATTCCTTGGCCCATCTGCAGGAAGGACTGGAATCGCTCCAGTTATTGACGAAGAAGCCAATGACTCGGTGATAATAGATGATGCATACCAAATGTATGGCGTAAATGCAATAACAACAGCACCTCGGCCCATGCCGCGGCCGGGCCCATTCGAAAGGCTTGGCGCGCCACTCGAAGATATGCAGCGTCTCCGTGTAGAGAAGATGGAACGTCAGCTCGCAAACCTTACGGGACTCGTACAGAAAGCGCTGCAAGTCCCAGCGGTGGTGCCCGCACCTCCTGCTGTAGCTCCTAGACAGGAGTATCAACCCTACGCTGCAAGACCAGCTACGCAAGATGCAGCACGATTTGCCAGCACCGAGAGACCTCCAAAATTAGGCAAAGACAGATTTCAAAAATCTGTCTCTTTTGAGAAATCGGTATCCTTTAGCGATGACCCACCAGATATGAATTCTCCCAAACAGCATTCGCCTCAACACTCAGAACGTGATCGCTTGAAGCCTGCCCCGCCGCCCAAGCCAGTTGGACTCGTGGGCCAGCAGCTTCCCTTGCCACCTCAGAAGACCTGCACTCTTAACGTCAATCCAGATTTCTTTAATCAACTACGGTCTCTACAGAAGCAAAGCCGTGATCTTCGTATTGAGACTCGCAATCTTCGTCGGTCTACTTTAAATCAATCCATGCAGATGAGGCAATTGATGGCCGACACCATTACAAAAATAGGTGCCATCGCTGCGAGCTTTTGTCAGGAAGATCCAGACTCGCAACTAACTCGCGAAGAAGAGATCTATCGCCAGGATATGCTGCTATTGGAGAACGATTTGTATGAATTGGAAGCCACAGTTGAACGTCTGCGTGGCCAAGCGGCAAATAGGGAAACCCGCGTCAATATGGCGGACATCGAGCGCATTGCAATGGTGCTTTCAAAGAGCAGCAAAACCGTAGCCGACTGGAAACTGAAATTCCCGATTCTTCAAGAAACAATGAAGACTAAGCTTGCTGGTGAAATGGAAAAGGTGGTCCGTAAGGAGAAAATGCTGGAAGATGAACCAGAGCGTTTGGAGCTTGCTTTGCGTCGCTGCAAAAAATTGACAGGCACGCTTGTGACTTTGAAGAGGTTGGCTTGCGTCCAAGAACAGCGCCTGCCGGTCGGCGACGGCCGCGTGTCTCCGAGCTCATCCCAGAGCTCCATCACTGTGGGCCCCTCATCTGAGGAATATACTTCCGAAACCGCATGTGCAAGTGCGCCCACCAACAAGGCCGCCAGCGGCGGGCGAGGTGCTGCGCCCCGGGGAACAACGGAGCTGCGGCCAGAGAACGCCCTCGATGCGTTACTAGACGAACTGCAAACTTTCGCCAAACCGGCGGAACGCGGCGGGCGCGGCGACCCCCGAGACCCTCCTCTGCGCCGCCTTCACTCTTACCCCAGCGGCAGCGATACGGATGCTTCGCCCCCGGTACGTGCTCGCGGCCAACACCCTCCAAAGCCTCCAGTACCGGAGCGACATCCTGAACTTCTAGCAATGGCCGTCCGTCGCGCCCCGCCGCCGCCGCCGCCCCGCACCACCTCCCGCTCCCCGCTGGCCTCACCAACCTCACCACCCTCGCCCCCTTCGCCGCACTCCCCCCGCTCACCCTCTTGCCTAACTCACACCGCTGAAATCAACGATGACAAAAACGCCTCGCGTCAAGCTCTCCTAGAACAGCGCCACCAAGAGCTCCTTAAAAAACAAAAGGCACTACAAGAACAATACGCTAGACTGCAAATGATCCAACGCAGTGGGCCCACATTACCGATTAACGCCCAACCTGATCTTAAAAAAACTGGTAGCGAATCTAATCTCCTCACAAAATTAAATCTTAATCTAGCACCAGCTAATATGTCCGGCAGTATGACACATTTAGCAGGAGAAAGTAAAAAAAAACAAAACGATTTAACTTCAGAGCAACAAAATCCTAAGGAGACAGTGCCTGACGCGATGGCCACTACTAATAAGGTTTACGAGACTGACATACTGTGA

Protein sequence:

>DPOGS203296-PA
MLSRWKSKDKSEKSGKSSGSSSKKKRKGRDNEEDWQSDPRLDGGQSDTEAGRGPPRGRRNRRDDPRRHTLAGNHLYTQQSAITGLDEERYRVANVNIPEYAVPDKNRIMKPYPQIYPRMKKPPLPRYLPAPSNMLFEDDPGVMSEVETSSTGFRRGGKQRSSLPVVRTPSKTLERPLGLVFLQYRSETKRALLPNEVTSIDTVKALFVRSFQKQLTMQYMDSDHVKIYIHDTSKDMFYELEDDRAHLRDIRDRSVLRLFEATDVGGGVFPGSVGASSVAMPSTSSWDQDQNYFSEPEIDSDYHHQHVHNKSKVKAPAYYMGTPAPSTLPRGGSLLRAFSPAAPPVPADRVKALPPGPTAPSKPVRSYIPGSGSERCFPFASAECPRGTDQLYSIPAHSGDGYMSSPERSAPRPYEDPYYTQFLGPSAGRTGIAPVIDEEANDSVIIDDAYQMYGVNAITTAPRPMPRPGPFERLGAPLEDMQRLRVEKMERQLANLTGLVQKALQVPAVVPAPPAVAPRQEYQPYAARPATQDAARFASTERPPKLGKDRFQKSVSFEKSVSFSDDPPDMNSPKQHSPQHSERDRLKPAPPPKPVGLVGQQLPLPPQKTCTLNVNPDFFNQLRSLQKQSRDLRIETRNLRRSTLNQSMQMRQLMADTITKIGAIAASFCQEDPDSQLTREEEIYRQDMLLLENDLYELEATVERLRGQAANRETRVNMADIERIAMVLSKSSKTVADWKLKFPILQETMKTKLAGEMEKVVRKEKMLEDEPERLELALRRCKKLTGTLVTLKRLACVQEQRLPVGDGRVSPSSSQSSITVGPSSEEYTSETACASAPTNKAASGGRGAAPRGTTELRPENALDALLDELQTFAKPAERGGRGDPRDPPLRRLHSYPSGSDTDASPPVRARGQHPPKPPVPERHPELLAMAVRRAPPPPPPRTTSRSPLASPTSPPSPPSPHSPRSPSCLTHTAEINDDKNASRQALLEQRHQELLKKQKALQEQYARLQMIQRSGPTLPINAQPDLKKTGSESNLLTKLNLNLAPANMSGSMTHLAGESKKKQNDLTSEQQNPKETVPDAMATTNKVYETDIL-