Monarch geneset OGS2.0

DPOGS210419
TranscriptDPOGS210419-TA2865 bp
ProteinDPOGS210419-PA954 aa
Genomic positionDPSCF300062 - 504277-508323
RNAseq coverage24x (Rank: top 78%)
Annotation
HeliconiusHMEL0077540.081.00% 
BombyxBGIBMGA002751-TA0.074.30% 
DrosophilaCG13708-PA8e-6555.65% 
EBI UniRef50UniRef50_UPI000206371B1e-17037.61%UPI000206371B related cluster n=2 Tax=unknown RepID=UPI000206371B
NCBI RefSeqXP_969344.21e-16643.23%PREDICTED: similar to CG13708 CG13708-PA [Tribolium castaneum]
NCBI nr blastpgi|3287817925e-17037.61%PREDICTED: leucine-rich repeat-containing protein 49-like [Apis mellifera]
NCBI nr blastxgi|3071851901e-16438.16%Leucine-rich repeat-containing protein 49 [Camponotus floridanus]
Group
KEGG pathwaypgi:PG18648e-17 
 K13730 (inlA)maps-> Bacterial invasion of epithelial cells
Orthology groupMCL14442 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210419-TA
ATGCCTATAAAGTATAACCGTGGAAATGTAAGAAAAATTGGTTTTCGTACCAGAGGTATATTAGCACAATCCCTTGATGTCAGCGCTCCACGTGTGGACTCTGGAGCAGGAGGAGATGGTAGGTTATTTCTACAAATTCGACCTGCTCTGGCTGATCCACGTCATGCTGTATTACAAAGATCTAATTCAACTTTGTCGGGTAACTCATATTTATGCAAAGAAGATCCTAAAGACGACAGCAATGGACTACAGGCTGTTGGAGAAGGCAAAGTACAATTATCACGCACCCCACAGGAAAAAGACCGTCTCCCGGATAGGATTAGTCTTGATAGACGGGGTCTCTCATCAATACCTCATATCGTGGGCGAGCCAGGCTTGCGACTGCTCTCCCTACAACATAATTTGATTAATACCCTTTCTGGTCTGTCACCGCTTGATTTAAGTAAATTAGTTTTTCTAGATGTCTACGACAACCAGATCGATAAAATATCATCGCTTGATAGACTCTTCAGCTTAAGAGTACTACTTATGGGAAAGAATAGAATAAAAAGAATTGAAGGATTATCCAACCTTATAAAATTGGAGGTTTTGGATTTACACGGTAATCGGATAATTAAAGTTGGCGGATTATCGAATCAGAGCGAATTAAAGGTGCTCAATTTAGCGGGTAATCAGATCAAAAGTATGGCACCATCAGATCTTCAAGGCCTGATCTCGTTGAGAGAGTTAAACTTAAAACGTAACCGCTTAAGGAAACTACTCGGTTTTCAAAACACTCTAAAGCTACAGAAGTTGTATCTTGGAAATAATGATTTGCAAAGTATCGAAGACGTCGCTTCATTGGCTGAAGCAACGTCGCTAGTGGATGTATCTCTGGACGGAAATCCGGTAGCATTGGGCGGAGACTGCACGCCGTTTCTAGTTTCCTATCTCCCCAATCTAGTTACGTTGACGAACATGCATGTTAGTGAGCAGGTACGACAAGCAGCTATGGCCTGGCGCAGTAACAAGGAAGCAGCTCACGCAGCGTACTGCGCTCTCAGTGGCTCCGCGCAACAAGCCGCTAGGAGAGACCAAATTATACATAACGCAAGAACTAATTGGGAACTACTTAGATCAGAAAATAAGTGTTTTATTTCCGCCACAACGCCAACAAAACAAAATGAAGATGAAAAACAAATTGAGCCCCGATCGTCTAAAGGTACAAAGAATGATGCAACACAAACAGTGGAGACAGTCTGCACTCCGGATCTAATAGCTTCAACTCAACACCTCGAGGTTCAAGAACCCAACAAAGGCAATAGAAGTAATAGCGAAATAAAGACAAAAACTATTCATGATAGTACAAATTCAAAATCAGTGCCCACTAAGAAACTCCAACGAAGTTCAACTGCCCGGAAGCCATCTGAAAGACGAGTGAACTTCTCAGAAAGGAGCGCCTCCCAAGAAACGGACGCCTCGCACTCAACATCCACCAGCAGCGATTTGCGTCTCCCTCCGATCTTACTGCCAATAATATCATCCTTGGAAAATGTTAAGCTAACAGATAGTTCCGAACCTATACTCAAAAGGTGGGAGAGTATATCTAGCGTAGAACCAGTTGGAGATTCGTCGTTTAGCTCCCTACAATCATCAACCAGTGACAGCGACGAGGAGACCATTAAGAGACAATTCCGTAGAGTCCCAACAGTATTGAAGAGGCGTGAACATTTTAGCACGGTGCGATCAAAATCAGTCTGCGATCCAGAAAGCAGAAGAGTTAAGACAAGCAAAAATGTTGACAGTGAAAACGCTAGTAATATATCTTCTGGTACGAATTTCTTTTCTGTTGGCACATCCTCAACATCTGGAAGTGATAACAATAGTAAGACTCTTAAGCGGCAAGGCTCATTAAATGGTAGACCGAACAGAAACATCCGCAGCGCCACAATAACGAGAAGAAGTGAGCGCGCTTCCTCAGCTCACAGAGCATCCACGGCCAGAGCCAAGTCAACGAAAGCTATAGCCAACATGAAGTACTCGGAACCAATAAAGCCAGTACCACCGAAGGACCGAGAACAAGGAGTCGATTATCTGATAGAAGTCAGTGAAGGCGTTGTCAGTGCGTGGGGCGCGGGAGCGGTCAGACGACTAGCTAGAGACTGGGAATGGGAGAAAGCGAAGACTGTCAATCATGCAGCTTTCCATTACGTACATTTTAATGCTGTTGCGCAATGCCTACCGGAACTGAAAGCCAAGTTTCCAAACGTCACTTCCTTATCTGTGCGGGCGACCGGTTTACAGAATTTGGGCCAATTACACGCTCTGGCTGAACTACGGGGACTGACATCGTTAACCGTTATGCCGGAGGGTAACCCTATATGTGTCAAAACATGGCGGGAATACTCTATATATAGACTTGCCCATTGGGGCCTAAAAGAGATTAATAGTGAAACGGTGACGGATGAGGAAATCAAATCTGCCAATGCAACGTATGCTGGCCTCAGTGATGTTGTACTCCGTGCGCTTCCAGACGCTCCTTTACAACCATTACTATCAAGATTAGGTAAAAGTAGGAACAGTACGATCAGCGCCAAGGCCTGGCTGAGGGCTGCGGATCCCGCATTAAGAGACGTTATCGCCAAAGAGGCTTTGCAATATAAGAAAAGTCACGTGTCACAGGAGGATATGACTTGGCGGGTGCGCGGTCGCGGTCAGTTATCGCACGCTATAGATCTAGCGTGTGGGGCCGCCATTAGACTAAGAACACTCGAATTACAATGGCCGACGATTTTCGTTGAAATGATCGAGGAAGTGTTACAAGATTTTTCTGACATGGAAAATCATGTTAAAGAACAAATGCGTATGCTTATGGATACATTATAA

Protein sequence:

>DPOGS210419-PA
MPIKYNRGNVRKIGFRTRGILAQSLDVSAPRVDSGAGGDGRLFLQIRPALADPRHAVLQRSNSTLSGNSYLCKEDPKDDSNGLQAVGEGKVQLSRTPQEKDRLPDRISLDRRGLSSIPHIVGEPGLRLLSLQHNLINTLSGLSPLDLSKLVFLDVYDNQIDKISSLDRLFSLRVLLMGKNRIKRIEGLSNLIKLEVLDLHGNRIIKVGGLSNQSELKVLNLAGNQIKSMAPSDLQGLISLRELNLKRNRLRKLLGFQNTLKLQKLYLGNNDLQSIEDVASLAEATSLVDVSLDGNPVALGGDCTPFLVSYLPNLVTLTNMHVSEQVRQAAMAWRSNKEAAHAAYCALSGSAQQAARRDQIIHNARTNWELLRSENKCFISATTPTKQNEDEKQIEPRSSKGTKNDATQTVETVCTPDLIASTQHLEVQEPNKGNRSNSEIKTKTIHDSTNSKSVPTKKLQRSSTARKPSERRVNFSERSASQETDASHSTSTSSDLRLPPILLPIISSLENVKLTDSSEPILKRWESISSVEPVGDSSFSSLQSSTSDSDEETIKRQFRRVPTVLKRREHFSTVRSKSVCDPESRRVKTSKNVDSENASNISSGTNFFSVGTSSTSGSDNNSKTLKRQGSLNGRPNRNIRSATITRRSERASSAHRASTARAKSTKAIANMKYSEPIKPVPPKDREQGVDYLIEVSEGVVSAWGAGAVRRLARDWEWEKAKTVNHAAFHYVHFNAVAQCLPELKAKFPNVTSLSVRATGLQNLGQLHALAELRGLTSLTVMPEGNPICVKTWREYSIYRLAHWGLKEINSETVTDEEIKSANATYAGLSDVVLRALPDAPLQPLLSRLGKSRNSTISAKAWLRAADPALRDVIAKEALQYKKSHVSQEDMTWRVRGRGQLSHAIDLACGAAIRLRTLELQWPTIFVEMIEEVLQDFSDMENHVKEQMRMLMDTL-