Monarch geneset OGS2.0

DPOGS207304
TranscriptDPOGS207304-TA1539 bp
ProteinDPOGS207304-PA512 aa
Genomic positionDPSCF300008 + 898952-905812
RNAseq coverage347x (Rank: top 34%)
Annotation
HeliconiusHMEL0073663e-7564.43% 
BombyxBGIBMGA012034-TA4e-11168.66% 
DrosophilaCG2685-PA4e-6250.62% 
EBI UniRef50UniRef50_E0VSB42e-8246.14%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VSB4_PEDHC
NCBI RefSeqXP_002429008.14e-8346.14%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3407167992e-8354.03%PREDICTED: hypothetical protein LOC100643531 [Bombus terrestris]
NCBI nr blastxgi|3072055231e-13350.00%WW domain-binding protein 11 [Harpegnathos saltator]
Group
Gene OntologyGO:00063969.4e-24RNA processing
KEGG pathwayphu:Phum_PHUM4149001e-82 
 K12866 (WBP11, NPWBP)maps-> Spliceosome
InterPro domain[12-94] IPR0190079.4e-24WW domain binding protein 11
Orthology groupMCL17026 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207304-TA
ATGGGGCGACGTTCAATAAATACCACCAAGAGTGGTAAATATATGAACCCTACTGACCAAGCACGAAAAGAAGCTAGAAAAAAGGAATTAAAGAAGAATAAAAGACAAAGACAAATGGTCCGGGCTGCAGTGCTTAAAATGAAAGATCCAACTCAAATACTTGAAGAACTGCAGAAAATTGATGAAATGGAGTACAATGTCCTTCAGCCGTCACCTCTTAATGAGAAAGTACTTAAAGAGAAAAGAAAAAAGCTAAAGGAAACCTTTGACAGGGTTCTGAAGATGTATGATAAAGATGATCCAGAAAAATGGGTGGAGCTAAAGAAACAGGAGATGGAATATGAAACGCGCAGGGCTCATTTAATATCATACTATGAATCTGTGAAACATGCACAATCTGTGCAAGTTGATGACATCCCTTTACCCACACTGCAGGTTCCTGACAATCTTATTTATGGGAATGTGCCCTCACAGATACCACTGCCCATAGATTCAATACATCCAAATTTACCAGTTCGACCTATACTGAAAAAGGACTCTGTATATAGAGAGAGGCCGTCAGGTGTTGAGCCTCCTGGAGTGCCTCCCGGCCCGCCACCGGATTTGGAGTCGTTACTCGGCCTTGAAAGCAACAAGAAGGACAAACACAAGGAAAAGAAGAGCATCCGATTCTCAGATCTACCAGACGAACCTCCGGGAGAACATAGATGTCAGAAAGTATCGAATCATGAGGACACTGAAGATGCAGGGGATTCAGAAAAGCAGTCTCAAGCTCCGAAACCGACTTCTCTACAGCAAAGAATGTTGATGCTGTCGGGACAGAACATAGACGACTTCATGAAAGAGATGGAAGAAGTGCATCGCAAGAGAGAGAGAGACAGAGCTGCGGATTTGAGTGCCAGACTGACTGCCCTGAAACGTTCCGGTGGAAACCGTGACAGTGATTCTGACAGCGACAACGAGACCCACGGTCATCACCACCACCACGACTTCACGCCTGAACCACCGGGGGCTGAACTACGACAGATACCGCCGCTCATACCACCAGGGCTCCGCCCTCCGATGCTTCGTCCGCCGGGAGTTCCCCCTGGGGCGCCGCCCGGCCCCCCTCCCGGATCACACCCTGCACCACATTCTGGAGGTCTGATGCTGCCCCCGGGTCCCCCGCCGGGTCTGCCTCGCGCCGCCCTCCGTCCGCCGGGCCCGCCGCCGGGTGCTCCGCCCCGATTGAGGCCACCTCACGCTCCTCTTACAACCCACGGCCCCCTCGCGCCGCATGTTTCAGTGCTCTCCGCTGCGCCACAGCTCATCACCAAGAAAGACGCCGTTCAAGGTGCGACGATCAGTGCGAAGCCTCAGATAAGGAATTTATCGGCGGACGTGACCCGCTTCGTTCCGTCGGCTTTGCGCGTTAAGAGAGATGATAAGAAATGTAAGCCGGACATCAGGTCGACCTTACACCGCCCGCCGGAGACGAGGCAGCCGAGCAAGGACGACGCCTACATGCAGTTCATGAAGGAAATGGAGGGATTGTTATAG

Protein sequence:

>DPOGS207304-PA
MGRRSINTTKSGKYMNPTDQARKEARKKELKKNKRQRQMVRAAVLKMKDPTQILEELQKIDEMEYNVLQPSPLNEKVLKEKRKKLKETFDRVLKMYDKDDPEKWVELKKQEMEYETRRAHLISYYESVKHAQSVQVDDIPLPTLQVPDNLIYGNVPSQIPLPIDSIHPNLPVRPILKKDSVYRERPSGVEPPGVPPGPPPDLESLLGLESNKKDKHKEKKSIRFSDLPDEPPGEHRCQKVSNHEDTEDAGDSEKQSQAPKPTSLQQRMLMLSGQNIDDFMKEMEEVHRKRERDRAADLSARLTALKRSGGNRDSDSDSDNETHGHHHHHDFTPEPPGAELRQIPPLIPPGLRPPMLRPPGVPPGAPPGPPPGSHPAPHSGGLMLPPGPPPGLPRAALRPPGPPPGAPPRLRPPHAPLTTHGPLAPHVSVLSAAPQLITKKDAVQGATISAKPQIRNLSADVTRFVPSALRVKRDDKKCKPDIRSTLHRPPETRQPSKDDAYMQFMKEMEGLL-