Monarch geneset OGS2.0

DPOGS215442
TranscriptDPOGS215442-TA3258 bp
ProteinDPOGS215442-PA1085 aa
Genomic positionDPSCF300298 + 57568-83813
RNAseq coverage71x (Rank: top 66%)
Annotation
HeliconiusHMEL0158430.062.94% 
BombyxBGIBMGA004604-TA5e-10488.18% 
Drosophilamun-PE9e-6771.52% 
EBI UniRef50UniRef50_E0VAU62e-9943.23%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VAU6_PEDHC
NCBI RefSeqXP_002423240.13e-10043.23%hypothetical protein Phum_PHUM044820 [Pediculus humanus corporis]
NCBI nr blastpgi|3454863843e-11443.47%PREDICTED: hypothetical protein LOC100122716 [Nasonia vitripennis]
NCBI nr blastxgi|1571129183e-14044.06%hypothetical protein AaeL_AAEL000129 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[473-554] IPR0160171.3e-09GDNF/GAS1
Orthology groupMCL11106 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215442-TA
ATGTTCGAGAGTAGTACACTATCCGAGCGGAATCTTCTCGCAGTTGCCTGCTCTACCGTGACCGTTACCAAGTGTCAAGCCGCTCTCCGAACTCTGCAAGCGTTTCCATTCTTTAAACCGACTTGCCTCTGTCGAGAACCGAACGTCGATCCAGAATGCAACTCCTTCCGCGACTTCCTATTCGATCATCCCTGCGTTTTCGTGATGAAAAAAGAGAAAGATCCCTATCCCGTGGAGACTCTGCCAACTTGCACCTATGCTTTAAGTGTTTGCCACAACGAGAAAGCTTGTTCCGTGCTCTTCGAGAGATTCAAGAACGCTTGTAAAGCCAGAGACGGGGAGTGTCGTATGGAGGAGAGGGAATCCTGCCGCGAGGCCTGGGCCGGTCTTCGGCTTTCTCCACTCTTCGGCTGCATCTGCCCCAACACCCACATGAAGAAGAGATGTGATAGGATTTTCGCTGTCGTGAACCACAACCCGTGCGTCGGCTGTAAACCTTTATGTTTCACGGACACTGAGCTTCTTTTTAGTGTACTCCTGACCTCACACACCGTACAAATGCCCGCAGTTACCTCAACTCTTGGCTTCACACCTTCGGCCTTCGGTGATGCGGCAGCCTGGGGTGAATGTCCACCGGCAGTCCGGGCGCTGTTCGAGCCGAAGCGGCCGGCTGCCCGGGCTCCACCCCTCACGCGCTATGCTGCTCCCGACGGTACGATCCTGCTGCTCACCGAGAACGGCACCAGGCTGATGACTCGTGCCGCTGCCGCCAGTCTCCTCCGCGCTGCCCTACTCTCCCCGCGCCCCCCTAGACCCCGCCACGTAAAGCTGATATCTGACGGACGGACGGCAAATCGAAACACCACGGCGGTGACGTCATGGATTGGTGGTATGTCCATAACGCAGGACGGCAACCTTATCACTCCGGACAAATTAATGACACACGTACACCGTTATGCAAGCCCCCACTACAGGCCCCATGGGCACCATTACCACCATTGGAATCATGAGCCAGGGGGAAGTGAAGATCCTCTATACGCTACCAAGACAGTCTACGGACCAGGCTACATCGATGAAAACCAGAGGATTGGCAAAGAAAACGGTAAAGCAGGTTATGAAAAGGTGGCCCTTCAATCGACCTGTCACGTTGCATTGGACGCATGCATAAAGGATACGGACTGCGTGACGTCACTCACGCCCGTGCTTCAAAAATGTCACTCAACCGACTGCGATAGGGAGGGATGTATGTCAGCGTTGAGGGGATTCTATAGGAAGCCTGGTGTACATTGGAACACGGAAATCGCATTCTGTCTCTGCAAGAAAACGGATAACAAGGAAGACTCGTGCATGAATGCCCAAGAGAGATTACACCCATCTTGCGCCCAACGACCAGCTGTCGGTTCACCGCTTCCCGCCTGTCATACCCTGGCCCACGCGTGTCGAGAGGAACCCGAATGCAGAATCCGCCTGGAAAACTACGAGCAATGGTGTGCCGTGGATGCCGTGACGCAAGGCTGTGCTGGATCTCCAGCAGCGTGCCGAGCGGCTGTTGTAGCTGTTCTGGGTACTCAGCTAAGAGCTGCCTGCGCCTGCAGAGGAACTGATTTTGCCCAGCTCTACGATTGCCTTGGATGGCAGAGATTGTTATGGCTCAATCCATGTGTTGTGGAATCTCAAAGCGATTACCACGTCAAAACCTACGGAGCCCTGCATACAACTACTCCATTCGACAACGGCGTCAGATACATGACAGTACCACCAGAGCCAGCAGTTCATCATCATACCACCTCACACCACCACAAGCATACAACCCAGCACGGCACGTCGCAAGTTGACGTTATGCAGGAGAATCGCCAAGAACAACCCCCGATCACAACTATGTCTGAACAAAAGATCCATAGTGACGCAGAACTGGAAGCAATCATTGTCACGACTATTGAAACCACAACAATGACCACCACAACGACGACGACTACCACGACGCCAACCACAACGACCACAACCACAACTACCACGACCACACCACGTCCGACCACTACCATGGAGCCAACGAAATATTGCATTGTTAAAAAGCCAGAGAGTGATGATCAGGCGCAATACATTAAAATGGGAGAAACCAGACGTCTGTATCGGTCTGCGGAGTTAGCGGAGTGCACAGACCTGTGCGTCTGTGAGGCCTCCTTGCAGGTGTCCTGTAAGGTCGCCTGTGTACCACGAGCCCCGTGTTCCTCCCGTCTAGCACACTACTCACACGCAGCACCCGCCTACCAAGCTTACCGCGGACGTTGCTACTGTTACTCTGGATCATTCATCTGTATGCGGCCTAATCCAGGTGAATACAAGTTGCCTGAGGGCGTCTACCTCTTGCTGGGATTCAGTGCCGTGGACGAGGCCTTACTGAGACCTCATACAGGACTCGGGGCCGAAGATGCCGTGAGGCTGCTTCAACGATATCTCAGGATCGCGCATCATGGAGAGACAAATTGTACCCTGACTCTGTTCAACATAAGCAATGAAAACGTTATCATCTCCGCTAGCCTACCACAAAAAGAACAAGAGGCTCTCAAGGAAGCCGGGGAAGCTCTACTAGATAGAGAGAAGGAAGCCTGCATAGATGTCTTGAAAGTAGTGAAAGCTCGCATTAACTCACAACACGAGGATATTTCGTCACACCTTCTCCTCTCCATCTTCAAGATAGCGGAAGTCGACGTCGTTTATCCAACGCCACCCAGCTCCGCGATATCCACCAACAGGCCAGATACCATCATTTACTTCATCATATGCCTCATCACTATCATGTACAACCTGCGTGACGTCACTTCCGCGATAACAATCACCACGACCGGAATCGTCAATTCCCTAACGAATTTATCTACAAAGACGTCCCCCTCATATCATATCGTTCCTTTTCAGATTTATATCATAGAATATATTTTATCTATTACGAAATGTGATGTATTTATTAATGTTGTAAATGACTCGCACCAAGCACTCGCTTTAGATGACGCACATACAAACACAAACGACGCTGAATTAAGTGTAGATATAAATTTATATAAAATAACAGAACATATCATAGCTGTGAATACAATATATGATAATTTATATGAAATTATTGCTAATGCAAACGTAACTAAATATTTTTCTAGTGTTGTGTATTACAAATTCGTATTTATTATGACGGAGATGCTTCTCGTGAGTTCTGTTATGGAAATATCGCGTCCTCTTCTTATGGGATTTGAATATCCGTTAGCTTAG

Protein sequence:

>DPOGS215442-PA
MFESSTLSERNLLAVACSTVTVTKCQAALRTLQAFPFFKPTCLCREPNVDPECNSFRDFLFDHPCVFVMKKEKDPYPVETLPTCTYALSVCHNEKACSVLFERFKNACKARDGECRMEERESCREAWAGLRLSPLFGCICPNTHMKKRCDRIFAVVNHNPCVGCKPLCFTDTELLFSVLLTSHTVQMPAVTSTLGFTPSAFGDAAAWGECPPAVRALFEPKRPAARAPPLTRYAAPDGTILLLTENGTRLMTRAAAASLLRAALLSPRPPRPRHVKLISDGRTANRNTTAVTSWIGGMSITQDGNLITPDKLMTHVHRYASPHYRPHGHHYHHWNHEPGGSEDPLYATKTVYGPGYIDENQRIGKENGKAGYEKVALQSTCHVALDACIKDTDCVTSLTPVLQKCHSTDCDREGCMSALRGFYRKPGVHWNTEIAFCLCKKTDNKEDSCMNAQERLHPSCAQRPAVGSPLPACHTLAHACREEPECRIRLENYEQWCAVDAVTQGCAGSPAACRAAVVAVLGTQLRAACACRGTDFAQLYDCLGWQRLLWLNPCVVESQSDYHVKTYGALHTTTPFDNGVRYMTVPPEPAVHHHTTSHHHKHTTQHGTSQVDVMQENRQEQPPITTMSEQKIHSDAELEAIIVTTIETTTMTTTTTTTTTTPTTTTTTTTTTTTPRPTTTMEPTKYCIVKKPESDDQAQYIKMGETRRLYRSAELAECTDLCVCEASLQVSCKVACVPRAPCSSRLAHYSHAAPAYQAYRGRCYCYSGSFICMRPNPGEYKLPEGVYLLLGFSAVDEALLRPHTGLGAEDAVRLLQRYLRIAHHGETNCTLTLFNISNENVIISASLPQKEQEALKEAGEALLDREKEACIDVLKVVKARINSQHEDISSHLLLSIFKIAEVDVVYPTPPSSAISTNRPDTIIYFIICLITIMYNLRDVTSAITITTTGIVNSLTNLSTKTSPSYHIVPFQIYIIEYILSITKCDVFINVVNDSHQALALDDAHTNTNDAELSVDINLYKITEHIIAVNTIYDNLYEIIANANVTKYFSSVVYYKFVFIMTEMLLVSSVMEISRPLLMGFEYPLA-