Monarch geneset OGS2.0

DPOGS205091
TranscriptDPOGS205091-TA2004 bp
ProteinDPOGS205091-PA667 aa
Genomic positionDPSCF300074 + 310537-333585
RNAseq coverage737x (Rank: top 18%)
Annotation
HeliconiusHMEL0057492e-13963.64% 
BombyxBGIBMGA006884-TA2e-15583.11% 
DrosophilaCG32264-PD4e-9448.91% 
EBI UniRef50UniRef50_UPI0001791A572e-11846.89%UPI0001791A57 related cluster n=1 Tax=unknown RepID=UPI0001791A57
NCBI RefSeqXP_001948235.14e-11946.89%PREDICTED: similar to phosphatase and actin regulator [Acyrthosiphon pisum]
NCBI nr blastpgi|2700129452e-12650.30%hypothetical protein TcasGA2_TC004311 [Tribolium castaneum]
NCBI nr blastxgi|2700129453e-15653.07%hypothetical protein TcasGA2_TC004311 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[463-486] IPR0040182.7e-09RPEL repeat
Orthology groupMCL15051 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205091-TA
ATGTCGCTGAGCGCGCCCGCTCAGCCGGTGCGCGGGACCAGGACCAGTAGCCTCGACCAGCTTGATTTCCAAGAGCGCAGACAGCTCATAGCTAGCTCGTTGTCCTTAACAGACTTTCTGCATGTTGGAGCTAAAGAAGTGGCTGCTGTTGCCGCCAAAAAACAGAATGGCTCCGCGGTGCGAACAAACAGCCTGGGTTCCGGCGCGCGCACCCCGCCCGCCGAAAGAACCAAGTCTAAGTTCTCCGCGTTCGGTAGACTTTTCAAACCCTGGAAGTGGAAACGGAAAAAGAAGTCGGAGAAATTCGAAGCAGCCTCTAGAACTTTAGAAAGAAAAATATCCGTCCGAGCGAATAGAGAAGAGTTAGTTCAAAAAGGTATTCTCATGCCGGAGAGTCCTGTTACTCCCGTTCCGGAAGCCAACGAGGATGTCTCCCCGGGCTACGAGGAGGGTCGTGTTGACCGCGAGTGTGTGCCGGGCGGCGCGACACTCAGCGAGCGCTCGCTCGGCATGCATGCAGCACTACAGGCCCAGCTGGCAGCACAGCTCGCACAGCAGCAGCTAGCACACCACCACCAGAACAATAATATCATCGAACCAAAGAAAGATAAATTGGAGAATGGTGGTGAAATGGTGAGGCCAGAGAGACCGAACTCATTGTCAGCCGGCAAGCTCCCAAGACGGATGTGCTATCAAGGAGACGTTGTGGGCGGTTATGGGGGCGGCGTGGGCAGCGTGGGTGACGGCGATGTCAGCGAAGAACAGCTCATGCTGTCGGAGCTGCCGGAGCCGCCCATCGCGCTGTCGGAGATCGGTCCCATCCCGCCTCCGCCCATGTTCAGCTCGCCGAGTCCCACCCGCCATCACCACCCTCACCAACATCCCCACCCTCACCCTCACACACACCCTCATCCTCTCTCCAATCAACTGTCTGATTCTGAAGACGAGGATTCTCAGGAAGAGGAGGAGGGTGAAGCCGTGGCCGGTGCGGACACGTCTCGGGTCGAGGAGATTCCTCCAAAGGAGCCCTTATATAACGCGGTGCCGCTCAAGTCCGCCCTTAAGAAGCGACCGGCCGCCTCGCCCGCTGGGACTCCTCTCGCCACGCCGCTCGCACCGAGACAGGATCACCATCACGCCAGTTTTAACAGCCGTCCGGTGCGCGTGGGCAATGCCACGGATAACAAAGAGAACGCTCGACCCTGGGAAGGTGACTACAGCGAGTACTCGAGCGAGTCTGAAAGAGTAGCCGCCAAGTTAGCCCGTAAGGAGAGCCTCAACATCAAACTGGCGCTACGACCGGACAGACAGGAGCTAATTAATAGGAATATCCTGGTGGTGCAGAGCGAGCACGAGCGTCAGGAGTCTTGGGAGGCCATCGGCGCGAGACTCATACGTCGCCTGTCAATGAGACCCACTGCTGAGGAATTGGTAGAGAGGAACATACTCAAGAGTCAGTCACCAGCGGAGGAGAAGAAACAAAAGGAAGAGAAAAAGCGTTACCTCCTGCGTAAGCTTAGTTTCCGACCCACCGTCGACGAACTGAAGGAGAAGAAAATCATAAGATTTAGTGATTATATAGAAGTGACACAAGCGCACGATTACGATCGTCGCGCTGATAAGCCTTGGACGCGCCTGACGCCTCGTGACAAGGCAGCCATCAGACGGGAGCTGAACGAGTTCAAGAGCTCCGAGATGGCGGTGCATGAGGAAAGCAAACATCTTACCAGCATTTCAAGGACCCAGCTAGAGGAATACGAGAGAATGAAATTCACTATCCTTTTTAACACAGTCTTACAAAATTCCATAGACCATGACGGCTGTGCCCTAGTCCTAGTAAGGAGAGCGGCCTGCGGGTCTTCGGATGAGCGTGCGCCCCTCGGCCCTCGGCCCTCGCCCCGCACTGTGATTTATTATATTCATATTGACCTCTTTCGCATTCGGAAGTTTGTATGCAATAACACGAGAGGCGGCGGCGGAGCGACCCATACCGTCACCCGCTAA

Protein sequence:

>DPOGS205091-PA
MSLSAPAQPVRGTRTSSLDQLDFQERRQLIASSLSLTDFLHVGAKEVAAVAAKKQNGSAVRTNSLGSGARTPPAERTKSKFSAFGRLFKPWKWKRKKKSEKFEAASRTLERKISVRANREELVQKGILMPESPVTPVPEANEDVSPGYEEGRVDRECVPGGATLSERSLGMHAALQAQLAAQLAQQQLAHHHQNNNIIEPKKDKLENGGEMVRPERPNSLSAGKLPRRMCYQGDVVGGYGGGVGSVGDGDVSEEQLMLSELPEPPIALSEIGPIPPPPMFSSPSPTRHHHPHQHPHPHPHTHPHPLSNQLSDSEDEDSQEEEEGEAVAGADTSRVEEIPPKEPLYNAVPLKSALKKRPAASPAGTPLATPLAPRQDHHHASFNSRPVRVGNATDNKENARPWEGDYSEYSSESERVAAKLARKESLNIKLALRPDRQELINRNILVVQSEHERQESWEAIGARLIRRLSMRPTAEELVERNILKSQSPAEEKKQKEEKKRYLLRKLSFRPTVDELKEKKIIRFSDYIEVTQAHDYDRRADKPWTRLTPRDKAAIRRELNEFKSSEMAVHEESKHLTSISRTQLEEYERMKFTILFNTVLQNSIDHDGCALVLVRRAACGSSDERAPLGPRPSPRTVIYYIHIDLFRIRKFVCNNTRGGGGATHTVTR-