Monarch geneset OGS2.0

DPOGS201464
TranscriptDPOGS201464-TA1506 bp
ProteinDPOGS201464-PA501 aa
Genomic positionDPSCF300006 - 225043-228729
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0159481e-16464.11% 
BombyxBGIBMGA002616-TA1e-14665.16% 
DrosophilaCG5819-PB4e-2725.74% 
EBI UniRef50UniRef50_E3WSD92e-3832.40%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WSD9_ANODA
NCBI RefSeqXP_001662248.18e-3832.88%leucine-rich transmembrane protein [Aedes aegypti]
NCBI nr blastpgi|3123817236e-3832.40%hypothetical protein AND_05915 [Anopheles darlingi]
NCBI nr blastxgi|3123817232e-4331.69%hypothetical protein AND_05915 [Anopheles darlingi]
Group
KEGG pathwaynpu:Npun_F12133e-18 
 K13730 (inlA)maps-> Bacterial invasion of epithelial cells
Orthology groupMCL44369 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201464-TA
ATGAAACGTGAAACATCAGTGTTAAGTTTTACATTCACATTATTATGGCTGCTGCCAGTGACGGTGAGCTCTGGGGTCCTGCGGGAGTTAGAGGACTACACCGATGACCCTTGCAGGGTCTATAATATGGATAATCTTATCCACCTGGATTGTTCCTATCGAGGACTTAGTGAATTACCAGATGGGTTAGATTATAACGCTCAAGTGTTAATTCTATCAAATAACAACTTTGTCGATTTTCCAGATGGACTGGAAAAGTTCAATAACGTACAAAGTTTGGATTTATCGGGAAATCTTTTGACAACACCACTGCCTAGATACTTTCAAAATTGGGCAGACCTTAAAATCCTCAATTTGTCTAACAACAATTACATTTCATGGTTAAGCAGTAATCATATCTTTACTATTGTTAGACTTGATTTATCTAAGAATAAGATAAACAATATCGAAGGAGATGCTTTTGCCAAAATGCCAAGACTAGTATTCCTCGAATTATCGGAGAACAGAATTGACAATCTCCCAAATGGAATATTTAGCAAGGCAACAAGATTAGAAATAGTGATACTGTCAAGAAATTATTTCTCCGAATTACCAAGATTTGAATCTGTATCTCTAAGAAATTTACATGTGAGTAGTTGTCAAATTACTACTCTCGATGTTAATGCATTAAGTGAGATGCGGTCTTTATTGGAATTGGATTTGTCAATAAATCAGCTTGAATCAATTCCGGATAATCTAGCCTCAAATTCATTGCAGGAGCTAGATTTAAGTTACAATGGAATTGATACTCTTACGGATCTAACTTTCTCTTCGCTGCCGCACTTAGCGGTTTTGGATTTGAGAGGTAACGAATTTAAAGAGGTCTGGTCTACTTCTCACTTCGCCTCAAATCCATTTTTGAGAACGGTTCATGTGAAAGGAAACAGATGGAGTTGCGAAGGCTTCAGCGTCAACCTCCTCTTGACGTATGAATTTTTGACGAAGGAACCGCCGAAAGTTTATGACATAGGTTCATTGATTTGTTATTCTCCATCAAACGTGACGCAAATGAGCTGGCAGCAGGCTTACATCCGAACATGGCATGCAGATGAACAATCACCATCGACTTACACTTTCATAGCTGTTATGATAGGAATGATCATGGGGGTCATTATAACCTCTTTTGTTTGTAGATTTGTAATGTCCTTTAACAAACCTGATCCACCTACATCTATCCCTGAAACTACAGCTTTAAATGGTACTACAAATCAACCGAGGTCTGAGTCCGTTGTAATGAGGGTGCAACTAAGGGAGGACCTACCACCAACTTATGACGAAGCTCTTCTATTACCCCGATTGAATGCCTCATTTCATTCTCTTCCTGATTTCATAGAGGAAGAAGAAGTAGCAGAAAATCGTGTACAAAGACAACGTAGATCTAGATCAATAGGCGATTTAACAGATACTAGACATAGGATCGATGATCGGCGATCAGTAAGAAGGGCTGTTGATATTAATTTTGATTAA

Protein sequence:

>DPOGS201464-PA
MKRETSVLSFTFTLLWLLPVTVSSGVLRELEDYTDDPCRVYNMDNLIHLDCSYRGLSELPDGLDYNAQVLILSNNNFVDFPDGLEKFNNVQSLDLSGNLLTTPLPRYFQNWADLKILNLSNNNYISWLSSNHIFTIVRLDLSKNKINNIEGDAFAKMPRLVFLELSENRIDNLPNGIFSKATRLEIVILSRNYFSELPRFESVSLRNLHVSSCQITTLDVNALSEMRSLLELDLSINQLESIPDNLASNSLQELDLSYNGIDTLTDLTFSSLPHLAVLDLRGNEFKEVWSTSHFASNPFLRTVHVKGNRWSCEGFSVNLLLTYEFLTKEPPKVYDIGSLICYSPSNVTQMSWQQAYIRTWHADEQSPSTYTFIAVMIGMIMGVIITSFVCRFVMSFNKPDPPTSIPETTALNGTTNQPRSESVVMRVQLREDLPPTYDEALLLPRLNASFHSLPDFIEEEEVAENRVQRQRRSRSIGDLTDTRHRIDDRRSVRRAVDINFD-