Monarch geneset OGS2.0

DPOGS202460
TranscriptDPOGS202460-TA2253 bp
ProteinDPOGS202460-PA750 aa
Genomic positionDPSCF300174 + 236865-240894
RNAseq coverage3428x (Rank: top 4%)
Annotation
HeliconiusHMEL0064362e-0617.47% 
BombyxBGIBMGA009978-TA4e-1941.89% 
Drosophila% 
EBI UniRef50UniRef50_D6WN311e-1788.89%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WN31_TRICA
NCBI RefSeqXP_001120943.11e-2084.62%PREDICTED: similar to Nopp140 CG7421-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3454804642e-1882.69%PREDICTED: hypothetical protein LOC100117990 isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1892377491e-11140.22%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[695-748] IPR0077182.8e-20SRP40, C-terminal
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202460-TA
ATGAGTGTTTTACCAGAAAATATTAAGGCTGAGGCCAATTCAATTATTCATCAATATTTAAATAGTATAGACAAATCCTTAGCGAGTAACTTTTTAAAAATAACAAAAGCGAAACCCAGAGCTAAGAATCTGCCGTCTTTTGTTGATATACTTCAACAATTCAAAAGCCAGCAAAAGCCCAACAAAAAACAAGCTGAGAGCTCTGATGACTCTAGCGAAGACGAAAAGCCTAAAAAGCCTGTTGCCCAAATAAATGGTAATGCCCAACAGAAAAAGAAGGAATCCTCTGATTCAGATAGCTCGGAAGATGAGAAACCTCAAAGCAAACCTGCACCCAAGTCTAACAATGTTGCACCGAAAGCTGCTCCCAAGAAAGCAGAATCATCAGATGACAGTGACAGTAGTGATGATGGTTCAAAGAAACCAGTAGCTCAGCCGGTGAAACCTCAAGCAAACAAGGCAGCTCCTAAAGTGGCTAAACAAGAATCTTCTGATGATAGCTCAGACAGTGAAGATGAAAAGCCAGCAACAAAACCTACCCCAGTCAAACCAGCTCCAGCTAAAGCCACACCAGCTAAAGCTACTCCAGCTAAAACTCCTCAAAAGAAAGCTGAATCTTCGGACAGTGACAGCAGTGATGATGAAGCTAAAAAGCCAGCTCCTAAAGCAACCCCAGCCAAGAAACCTGTTGCTAAACCATCAAAAGCAGAAGAATCGTCAGATGACAGTGATTCGGATGATGCTACCAAACCACCTCCAAAGGCTGCCACTGCCAAGCCCACACCAAATAAACCAGCAGTTAAGGCTACACCCAAAAAACAAGAATCATCTTCTGACGACAGCAGTGAAGATGAGAAACCCCAACAGAAAGCTTTCCCCAAACTGCAAGCGGCAGCTGCCAAACCAGTTGGCAAGGTCAACAAGAAGCAGTCATCCTCAGAAGATAGTGATGAAAGTAGTGAAGATGAAAAAGCCAAGGCTGCAGCGAAGCAACCAGCTAAACCTGCAGCCAAACCAACCCCAGCCAAGAAGAAACAGGAATCTTCTTCTGATGATAGTGATGATGAACCCCCAAAACCGGCCCAAAAATCTCCAGCCCCAGCCAAACCTGCAGCAAAAGCAAAGGCAGAATCTTCAGACAGCGATGACAGCAGTGACGATGAACCTAAAGCTAAGCAAGCTAAAGTAGCAACCCCGGCTAAACCTGCAAAGGAAGATTCTTCTGATGACAGTGATGAAGAGCCACCTAAGAAGGTTCAAAAACCTGGTGCTAAACCAGCAGCCGCACCTAAAAAAGCACCATCAGACAGTGACGATAGTAGTGAAGAAGAACAAAAGAAGCCACCGTCTACTCCAAAAGCCAAGCCAAAGCAAGAGTCGTCTGATGATGATGATAGTGATGATGAAGCACCAACTGAAGTACAGAAGAAGGAAAATAATCAGTCTGAAAACGTCAAAGCTGGCAAGAAAAGGAAAGCCTCTGAAAATGAAGCAGAGCCAACACCCGCCAAGAAACCATATAGCAATTTTGTAAAGGGGACCAATTGCTCATCAACACCCAGTGACAAAGGCCCAGAAAGTAAGCCAACTCCCTTCAGGAGAGTTGTCACAGAAAAAGTTGAAGTGGATCCTAGACTTAAAGACAATTCATTTGAAGCTAAGGAAGGTGAGGATGACGCCGAAGAAGACGATGAACTCAACAAATCTGGAGGCGGAGGTGGACGAGGGGGAATGAACCGTGGAGGCTTCCGCGGCCGGGGAGGGTTTAACGATAGAGGGGGCAGGGGTGGATTCGGCGGCAGGGGAGGCCGCGGCGGCTTTAATGATAGGGGAGGCCGTGGCGGATTTAACGATAGAGGAGGTCGGGGAGGTTTCAGAGGCCGTGGGGGATTCAACGATAGAGGGGGTAGGGGAGGACGAGGAGGGTTCGATCGTGAAGGTCGTGGTGGAGGTCGTTGGGGAGACAGAGGTGGGAGGGGAGGACGGGGTGGCCGCGGTGATAGACACTCCTGGGGAGGGGACAGGGGGGGATTCAACAAAGGCGGGTTTAATAACAGAAAGAGTTTCGGGGACGGCGGCGACCAACAAAATAAGAAATACGGAGCCCGCGGCTCGTGGGGCGAGCGCGCTAATCAAGACTTGAAGCACACACGCGGCAAGTCATTCAAACACGAGAAAACAAAGAAAAAACGCGGTTCCTACCGGGGTGGCGCCATCGACACCGGGGTCCATTCCATTAAGTTTGAGGATTGA

Protein sequence:

>DPOGS202460-PA
MSVLPENIKAEANSIIHQYLNSIDKSLASNFLKITKAKPRAKNLPSFVDILQQFKSQQKPNKKQAESSDDSSEDEKPKKPVAQINGNAQQKKKESSDSDSSEDEKPQSKPAPKSNNVAPKAAPKKAESSDDSDSSDDGSKKPVAQPVKPQANKAAPKVAKQESSDDSSDSEDEKPATKPTPVKPAPAKATPAKATPAKTPQKKAESSDSDSSDDEAKKPAPKATPAKKPVAKPSKAEESSDDSDSDDATKPPPKAATAKPTPNKPAVKATPKKQESSSDDSSEDEKPQQKAFPKLQAAAAKPVGKVNKKQSSSEDSDESSEDEKAKAAAKQPAKPAAKPTPAKKKQESSSDDSDDEPPKPAQKSPAPAKPAAKAKAESSDSDDSSDDEPKAKQAKVATPAKPAKEDSSDDSDEEPPKKVQKPGAKPAAAPKKAPSDSDDSSEEEQKKPPSTPKAKPKQESSDDDDSDDEAPTEVQKKENNQSENVKAGKKRKASENEAEPTPAKKPYSNFVKGTNCSSTPSDKGPESKPTPFRRVVTEKVEVDPRLKDNSFEAKEGEDDAEEDDELNKSGGGGGRGGMNRGGFRGRGGFNDRGGRGGFGGRGGRGGFNDRGGRGGFNDRGGRGGFRGRGGFNDRGGRGGRGGFDREGRGGGRWGDRGGRGGRGGRGDRHSWGGDRGGFNKGGFNNRKSFGDGGDQQNKKYGARGSWGERANQDLKHTRGKSFKHEKTKKKRGSYRGGAIDTGVHSIKFED-