Monarch geneset OGS2.0

DPOGS200895
TranscriptDPOGS200895-TA3492 bp
ProteinDPOGS200895-PA1163 aa
Genomic positionDPSCF300066 - 321201-342014
RNAseq coverage797x (Rank: top 16%)
Annotation
HeliconiusHMEL0127347e-9765.76% 
BombyxBGIBMGA000546-TA3e-3756.82% 
DrosophilaCG7546-PD3e-1645.95% 
EBI UniRef50UniRef50_E0VZR72e-2163.16%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VZR7_PEDHC
NCBI RefSeqXP_001650192.12e-2258.67%hypothetical protein AaeL_AAEL014998 [Aedes aegypti]
NCBI nr blastpgi|1571083633e-2158.67%hypothetical protein AaeL_AAEL014998 [Aedes aegypti]
NCBI nr blastxgi|3071735102e-2322.86%Large proline-rich protein BAT3 [Camponotus floridanus]
Group
Gene OntologyGO:00055152.9e-16protein binding
KEGG pathwaycdu:CD36_246907e-08 
 K04523 (UBQLN, DSK2)maps-> Protein processing in endoplasmic reticulum
InterPro domain[7-72] IPR0006262.9e-16Ubiquitin
[235-344] IPR0219253.4e-08Protein of unknown function DUF3538
Orthology groupMCL23629 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200895-TA
ATGATTGAATTCACTATAAAAACGCTAGATTCCCGGGATCACCCGTTTTCCGTGGACGATGAGATTACAGTGGCACAGCTCAAAGAGAAAGTGCAGGAGCAGATGGGGATTGAAATTGGACTTCAGCGCCTCATCTTTTGTGGCAGAGTTCTTGCGGATGAAAAGAAACTAGCTGACTATGATGTCCATGGGAAGGTGATTCACATGGTGCAAAGGGCACCGCCATGCGTCGAAGAGCGGGAGACCTTGAGGGAGCGAGAGCGGGAGCGCGAGCGCGAGAGGGAACGTGAAAGGATGAACTCATTCACCAATCTAAATACGGATCCCATTAACTATGGAGCTGTTCATTTTAATCACATTACACAACAGCAGATAAGACGCCTTATGGCTTTGGCATCGACTGCCCATGGTATTGAGATCGAAGAGCCACCGGGCTCCGCCCTGTCTCCTACTGGGACGCGCTTGGACTTCCTCCGCCGTCTCATCATTGAAATACGATCAACCCTCGATGCTATCATACAAAATGAAAGTAATGAACCACGTAGTTTTTCAACTGAAGATCCATTGGAACCCAGAACAAGCCAGGGAGAATCTAGTTCGGTGCCAGATGAGCTCGATCAAGGCACCGGAGGTACCCGCGAGGGTCGCGGACGCAGGATTCGTCAGGCTCAGGCTGCTTACCACACGCCTCCTATTGAGTTCGGCCAGCTCGTAGCTGAGCTCCACGAGTTGCATAATGAGTTTACTCCCTTCAGGGAGGCATACATCATGACGCTAAATGAAGCCAGCGATTCGAATGTCCAGCTGACAGAGGACGTACTCCAACGTCGCCAGCGCACCGCTGATCTGGCCGCAGAGTTGTACCATAGCTTCTCTCACGCCTACCATGTCGTGAGCGATATTGGACTCATGTTGGCTCATCGCAACTCTCGTCTCATGTCGGAGGCTCTGATGCGCCACCCTTTGCCATTGCAGGCGCACATTAATGTTGTGCAAACACCCGCTAACCGTCGTCAGACAAACGCGTCTTCATCGACCGGCGCGGGTCCGTCCACCGAGAGCCCGCAACCCAGCAGTTCACAAGCCGGTAACCCGACCGTCAATATAGATATACAGCCAGATCCTATTACTTACCAAGTAGAAATAGAAACCAGGGTTCCGCTTGAAGCCACGGCTGAAAACCTGAACGATCAAATGCCGAGCCAGGAGGGTCAGGATCTGGGCGGTCGCCCACAATCTATGAACGATTTTGACAGTCTGTTTAGGGGATTGGGACAACCTGGCGGTATTAGGGGAGTTGAAGTACTTATGAGTATGGAAGAGATCACTCCGGTTAATGGTACTTTCACTGCTGCAATTCCAACTCTGAACTTGCAACCGGATGTGGGAGTTACCGGAGGTAACCAACCTCTGTACGGATCACAAATATATCTAGCTCAAATGCCGTGGGGTGCTGCTAATCAGGCAGCTCCGAGCGCAGATCTGTTGCAGAACATCGTGTCCTCAGTTATCAGACAGGGTCTCGTTGCTGGGATGGAGGGAGCTATGACCGCCCACGTGCAACAGGCCCATGTGCCAGGTCAAGGTCTCGGTGAAGGTCAGGTTCCAATGCAGGCGGACAACGCTCCGCCCCAACAACCCGACCCAAACCAGACACCAGCTCAAGAACAATCTCAAGAGAATCAGACCAATACAGAACAAAATACAAATCCTAGTACGCGACGCGTTCCAAGGCTGTTCACCCCTCGTCGTCAAGGAACGAACACAGCTCGCGGTCAGACGGTGTCCTTGAACAATTTGGTATACGACAGATTCCTTCAATGCGACAGTCATCACGCCCGTCGTCAGCTAACACGCCGCCGTGAGGAGACGTCGTTGGCCGGCGGACCTCTGCTTCGTGACGATAACAGTCAACGCGTGCAGAATAACGTGGAGACCTTGTACGAACGTTTCGACAGGAGCGCCATTAATGAAGAGTCTCTCATGATAGCTACTATGGTCACTCTGCGTGAGGCCATATCGTTCACCGGGGGTCGAACTCTGGTCCCGGACGAATTGCAACCACTGCGCTATCGTCTCCAAGTGTACATGCGCGAACTCATGCAGGGCGAGTACGAGGTTGGCACGCAAAGCCACCTCGCTGATCTGATATTCGAGCGCCACGCCGAATTTATTAACCGCGTTACTGCTATAACGCCGACTCGTCCCAACGTGGATGTGACTGCTTCAATGAAGGCTGTGTTCCTACGTTTCCTGAATGAGGCTATGACGGTGCTGGATATTGAGAACATTGAAGTATTTTCTCGTCGCTTCCGGATCGTGTACCCGAGGCTTTTCTACGAACTATGCGGAGTCATCTCTTATTGTTGCTTGGAGGGTGTTGAGGGTCTTAAGAAGATATACCGCTCTTTCTTGACGGAATTGCTGCAGAATGTTGGAGAACCAGTACGTGATCTTCTCTATAGCCTGTCGATGGAGAACTTGAATGCTGCGATCTGCCGCATTGAACATAACAGGCTCCACTTCGCACAGTTCATACGTCGCAAGGAACAGCAGCCTTCCACATCGACCGCGATCGTAATGATGAATGAGCCGTGTACTACAATGGACGTGTCACCTCGGCCCGAGCCGGTGCCGATGTCTCCACGTGATGATAATGCTGATTCAGAGGAATCAGATGCACCAGTGGCCGGCTGCGACCGCAAGGAGGAATCCTCGTCGGACATGTCGTCCAAAGACGAATCTTCGTCTGATAATTCACCGAAGGGCGAGACGTCACGCGATCAGCACAGAAGGGACGAGTCCAGGAGACATCATTCATCGAGAGTCGATTCACTCTCGCCAGTTTTGTTTGGCGCAACACGGAAGCCGTGGACATTAAAGAATTCAAACAAGAAGACGCCTTCAAAATTCCCCAAGACGTCTACGCCGAGGGAGCAACTCGCTCAGCCGACGACACCGCTGCAGGGTGCACCTAATGTCACACTCGTCAGATATGGAGCACCCAGAGTTACGAGTGGTTTACGTCACCGCAAGGTGAATCGAGCTAACAAATCAGGCTCTGGTTCTAAGCCAGATGCCTCTGGATTATTCGTACCACCTGAGTCGATAGCGCAACATTGGGGCGAAGAATGGGTGCCAACTTTCACCCGTGATGTACAGGAGCAGGAACATCGTGATACCGCTGAGCCCTACAGTGATGCCTACCTTTCGGGCATGCCTCCGAAGAAACGTAGATGCGTGCGACAGTCGCGACCTCCTACGACACTGAACGCGTTCATCGCTGAGAGCGTGAACGAGGTATCGTCTCTGGGCAGCGTCCAGGGCGAGGAGCTGAGGGCAGCGTTTCGCGAGCACATGAGATGCATCGCCCGCGAGCGCGCTGCCGTCTCCGAGGATTACGAGCCGCGCCGGTTCGTCGCCACTGCACGCTTCCTCAACCAGACCAGGACGAGTACGCGGAAGTCGCCAGAACGCAGCAGCTCTAATTAA

Protein sequence:

>DPOGS200895-PA
MIEFTIKTLDSRDHPFSVDDEITVAQLKEKVQEQMGIEIGLQRLIFCGRVLADEKKLADYDVHGKVIHMVQRAPPCVEERETLRERERERERERERERMNSFTNLNTDPINYGAVHFNHITQQQIRRLMALASTAHGIEIEEPPGSALSPTGTRLDFLRRLIIEIRSTLDAIIQNESNEPRSFSTEDPLEPRTSQGESSSVPDELDQGTGGTREGRGRRIRQAQAAYHTPPIEFGQLVAELHELHNEFTPFREAYIMTLNEASDSNVQLTEDVLQRRQRTADLAAELYHSFSHAYHVVSDIGLMLAHRNSRLMSEALMRHPLPLQAHINVVQTPANRRQTNASSSTGAGPSTESPQPSSSQAGNPTVNIDIQPDPITYQVEIETRVPLEATAENLNDQMPSQEGQDLGGRPQSMNDFDSLFRGLGQPGGIRGVEVLMSMEEITPVNGTFTAAIPTLNLQPDVGVTGGNQPLYGSQIYLAQMPWGAANQAAPSADLLQNIVSSVIRQGLVAGMEGAMTAHVQQAHVPGQGLGEGQVPMQADNAPPQQPDPNQTPAQEQSQENQTNTEQNTNPSTRRVPRLFTPRRQGTNTARGQTVSLNNLVYDRFLQCDSHHARRQLTRRREETSLAGGPLLRDDNSQRVQNNVETLYERFDRSAINEESLMIATMVTLREAISFTGGRTLVPDELQPLRYRLQVYMRELMQGEYEVGTQSHLADLIFERHAEFINRVTAITPTRPNVDVTASMKAVFLRFLNEAMTVLDIENIEVFSRRFRIVYPRLFYELCGVISYCCLEGVEGLKKIYRSFLTELLQNVGEPVRDLLYSLSMENLNAAICRIEHNRLHFAQFIRRKEQQPSTSTAIVMMNEPCTTMDVSPRPEPVPMSPRDDNADSEESDAPVAGCDRKEESSSDMSSKDESSSDNSPKGETSRDQHRRDESRRHHSSRVDSLSPVLFGATRKPWTLKNSNKKTPSKFPKTSTPREQLAQPTTPLQGAPNVTLVRYGAPRVTSGLRHRKVNRANKSGSGSKPDASGLFVPPESIAQHWGEEWVPTFTRDVQEQEHRDTAEPYSDAYLSGMPPKKRRCVRQSRPPTTLNAFIAESVNEVSSLGSVQGEELRAAFREHMRCIARERAAVSEDYEPRRFVATARFLNQTRTSTRKSPERSSSN-