Monarch geneset OGS2.0

DPOGS201486
TranscriptDPOGS201486-TA2148 bp
ProteinDPOGS201486-PA715 aa
Genomic positionDPSCF300006 + 216572-219074
RNAseq coverage1181x (Rank: top 11%)
Annotation
HeliconiusHMEL0159470.072.65% 
BombyxBGIBMGA002675-TA0.068.31% 
Drosophilawdp-PC2e-4526.82% 
EBI UniRef50UniRef50_Q9W2664e-4326.82%Transmembrane protein windpipe n=14 Tax=Drosophila RepID=Q9W266_DROME
NCBI RefSeqXP_001958917.17e-5027.87%GF12319 [Drosophila ananassae]
NCBI nr blastpgi|1947532251e-4827.87%GF12319 [Drosophila ananassae]
NCBI nr blastxgi|1947532254e-5328.51%GF12319 [Drosophila ananassae]
Group
KEGG pathwaynpu:Npun_F12136e-07 
 K13730 (inlA)maps-> Bacterial invasion of epithelial cells
Orthology groupMCL21894 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201486-TA
ATGTGGCTGCAAATAGAAAATGTTTTCGAATTGTACACACCCACGTGGCGCTATACAAACAAGATGTGGTGGTTTTCTTTCTTCTTCTTTGCTGCGACACTTCTGCTAGAGCCCACACTGTCGGCAGTTTGTCCCGATAGCTGCGTTTGCAGTACCACCAGGGATGGCCTGCACCGAGTCACATGTAGCAATCTCGGTGACCTTTACAAGTACACACTTCGTCAAAAACATCACAATATAAATATTCTTGATCTATCTCATAACAACATTAGTAAAATTACTCACGAATTGGACCGACTCACCGAAGTAGTAACTCTTGATCTATCAACAAATGGATTAACAGAAGTAAACAAATTTTTGCACAATGCAAAAAAGTTAGTCCATCTCAATCTTGCTCACAATAGGATACAAAGACTCTCCTTGTCACATCTACCTACAAGTGTTAGTTCCTTAGACTTAACTAACAATCTGTTAAAAGATGTGCCTTCTGAGCTTGGACATTTACCTAGTTTGGAACATTTAGAACTAGAAGGTAATCCACTGGATTGTTCTTGTGCCAACATTATTGCACGTAATCAATTATTAGCTGCAAACACTTATATCGATGCTGTAAAATGTGAAAGTCCTATGAACTTAAAAGGTCGATCGTGGCTTGAACTAAAAACAAAAGAAGTGTGCAAAGTTATAAAGCCAGATTATTTAGATATGATGATGGGCGATCAACCTATAGATGCCATAAAAGTTGGTGAAGAAACCACTGCCTTAAAACCAAATCAACCTTTTGCTGCTAGAAGTGATCTGGATGAAGGCAAAATCATTCAAGGTGCCGAATTAATGGAAGATGATGATAGTTTACTATTTATGAAAGTAGGCCGAATTACGTCACCCTCACCTCTAAGTGAAATCGAAGGGTCAGGTGAAGCTGAAAGTACCACTATTAGCGATTTTATGGAACCAATACAAGAGCGTAAAATGCATGAAAATCTAATGAAAAACGAGGAAATTATCCCTGCGGATAATATTCATACTACGGAGGCACCCACGGAGGGTTCTGGTGAAGGTTCAGGATACTTTTTCCTCGACGATTACGAAGATACGACCGAACAAATGATTGAAACCACTACTCCTATAATTGAAGAATTAAGCCCAGATCCAGTTCATAGGATATTTAACGATGACTTAGACACAAATAATTCCGAATTTCCTGTACCCGAACCACCAAACGTATATGCTGGAGGTCCAGATTGGCATATCAAAACGGATCCTGAAATTGTAACAAAAAAAGCTGTAACCGTAGAAGTTACAAGAGATACAACAGTTTCAGAACAAATAAGGGTCACACAAGCCTCTGAGGTCCTGGGCCCTGCCAGCGTCGGTGAAGAAAATGAGGTTACACATAAAACAGGAACATATGTTTGTATTGCTTTAATAATTGTTTTACTAGTTGGATTAATTGGGTTTGCAATAGCTAAAGGCCAAATGCGAAAAAGAAGGGACCGAAGACTACTAAGACAACAAAAACGAGATGTAGAAAAGGCATCAAAAGAAATGGTCGATATGAACAAATCCTTATTGGGTAAACCTGCTCCCGTAGAAAATCCTTCAGACAAAAAAGTTAATGGAAAATATGAGCTTGTACCGACACATGAAAGTTATCAAAAAAAGAATGAAAATGGTGATATGGCAAATGGAATTAAACATGATGAAAATCAAGATATTCGAACTGATAGTCCTCGGGATACAAATCAAAATAAAAGTAGTTCTATAGACAATAATTTATCCAATCAAAATGATATACCACAACAAGAAACATCATTTGAATCTAACAACTCAAGGAAAGATACAAATTCTTTGTCTAGTGAAGATATATTTGTTCCTATAAATGATGATGACAATCCAAGATTGAATGGCAATTTGGATCTGTCTCAGCCTCTCATAAATGGAGATCCAGTCGCAGATTCAGATTATTTATCACCATCTCGTGAGTATGTCCCCGTATACTCACCTGATATGGGAAGAGTTCGGATTAAAATGACAGAAGTTCCAAAACCGAAAACACCCGTTCTTGTCACGCGAAGTAGGTCAAACGCTGGAGACATAATTATAACACCATCATTGAGCGGAAATGCAACTCAAACGACTACTTGA

Protein sequence:

>DPOGS201486-PA
MWLQIENVFELYTPTWRYTNKMWWFSFFFFAATLLLEPTLSAVCPDSCVCSTTRDGLHRVTCSNLGDLYKYTLRQKHHNINILDLSHNNISKITHELDRLTEVVTLDLSTNGLTEVNKFLHNAKKLVHLNLAHNRIQRLSLSHLPTSVSSLDLTNNLLKDVPSELGHLPSLEHLELEGNPLDCSCANIIARNQLLAANTYIDAVKCESPMNLKGRSWLELKTKEVCKVIKPDYLDMMMGDQPIDAIKVGEETTALKPNQPFAARSDLDEGKIIQGAELMEDDDSLLFMKVGRITSPSPLSEIEGSGEAESTTISDFMEPIQERKMHENLMKNEEIIPADNIHTTEAPTEGSGEGSGYFFLDDYEDTTEQMIETTTPIIEELSPDPVHRIFNDDLDTNNSEFPVPEPPNVYAGGPDWHIKTDPEIVTKKAVTVEVTRDTTVSEQIRVTQASEVLGPASVGEENEVTHKTGTYVCIALIIVLLVGLIGFAIAKGQMRKRRDRRLLRQQKRDVEKASKEMVDMNKSLLGKPAPVENPSDKKVNGKYELVPTHESYQKKNENGDMANGIKHDENQDIRTDSPRDTNQNKSSSIDNNLSNQNDIPQQETSFESNNSRKDTNSLSSEDIFVPINDDDNPRLNGNLDLSQPLINGDPVADSDYLSPSREYVPVYSPDMGRVRIKMTEVPKPKTPVLVTRSRSNAGDIIITPSLSGNATQTTT-