Monarch geneset OGS2.0

DPOGS214902
TranscriptDPOGS214902-TA2958 bp
ProteinDPOGS214902-PA985 aa
Genomic positionDPSCF300135 - 235307-247922
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0045280.092.11% 
BombyxBGIBMGA002173-TA0.091.25% 
DrosophilaCG34127-PB0.046.77% 
EBI UniRef50UniRef50_Q7PGX10.060.55%AGAP003115-PA n=6 Tax=Endopterygota RepID=Q7PGX1_ANOGA
NCBI RefSeqXP_971088.10.064.93%PREDICTED: similar to CG34139 CG34139-PA [Tribolium castaneum]
NCBI nr blastpgi|910820430.064.93%PREDICTED: similar to CG34139 CG34139-PA [Tribolium castaneum]
NCBI nr blastxgi|910820430.065.04%PREDICTED: similar to CG34139 CG34139-PA [Tribolium castaneum]
Group
KEGG pathwaytca:6597170.0 
 K07378 (NLGN)maps-> Cell adhesion molecules (CAMs)
InterPro domain[56-673] IPR0020181.5e-123Carboxylesterase, type B
Orthology groupMCL10116 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214902-TA
ATGGCCATCCACAAACTGCCATTACATATTTGCAACTCCATAAAGCAGTACTGGTTAAAGATAGGTATTGTAAAAAAACTAGAAAAATCAAGTAAGACGAATGCAGTGAATCGAAAATGCAAACTGGAAACTCAGGAGAGATTTTCTTTAGTGATACTATTGGTGTTACTGTTAAGTGTAAATTCTAGTGGGAACAGTTTTAGTGGTGGTAGAAATTCTATGCTAAGAACTAGAATTATAGGTACGAGATACGGAAAATTACAAGGTGTTATACTACCAATGGATCAACATAAGTACTTAAAACCTGTGGAAGCTTATTTGGGTGTGCCATACGCTACACCACCTACAGGATCTAATAGGTTCGCTCCGACACGAGCCCCAGCTCCATGGGATGAAGTGAGGACAGTTGACCAAATGGGACCAGTTTGTCCTCAACGTCTGCCAGACATCACAAATGAGACTATCACTTTGGAGAGGATGCCAAAAGGCAGGCTTGAGTATTTGAGGAGATTGTTGCCGAGACTAAAAAATCAAAGTGAAGATTGTCTTTATATGAATATTTACACTCCAGTTCAAGTTGGTCCAACACTCCAAGCCAAATACCCTGTGGTGATATTCATTCACGGGGAGTCGTTTGAGTGGAACTCCGGGAATGTCTACGATGGCGCCGTTCTCGCCAGTTATGCGGGGCTTGTGGTCATCACTATTAATTACCGACTAGGCATATTGGGATTCCTTAATGCAAACCCTATACCACATTTGAAAGCGAGAGTAGCAAATTACGGACTGATGGATCAAATAGCGGCTTTACATTGGGTTCAACAGAATATCGCCCTATTTGGAGGTGATCCTGGAAATGTTACTATGTTGGGACACGGCTCAGGTGCTGCATGTATAAATTTTCTAATGATATCACCAACTGTTATGCCGGGACTTTTCCATCGAGCAATCTTACTATCTGGGTCAGCTTTGAGTTCTTGGGCACTTGTAGAAGATCCTGTTAGTTATTCTGTCCAACTCGCTAAGCAATCTAATTGTACTCTTCCCGAAGATATCGTCAAGGACCACGAATTGATCGTAGACTGCCTCAGAGAAGTACCTTTACAAGAGTTAATGTCTGCTGAAATTAGCACGCCGAGTTATCTCACAGCATTCGGACCTTCAGTTGACGGAGTTGTTGTAAAAACTGATTACGCGAAAGAATTGCTAACTTTCTTCATTCCAAATGACCTACAAGGGTTCACTAGTGTGTCCGGTGTCAACAATGTGAAAATGGACAAAAGAAGTGGAGATAGGATTTTCGGGATAAGAGGCGGGCAAAATAAATATGATCTGTTATTCGGAGTAGTAACGAGCGAAGCTCTTTGGAAATTTTCGGCTCAAGATATACAAAATGGATTTGAAGGAGAGAGACGAGACAGAATAATAAGAACTTACGTTAGAAATGCTTACACATATCATCTAAGTGAAATATTCTTTACTATTGTCAATGAGTACACTGATTGGGAGAGAACGGTTCAGCATCCAATCAATACTCGGGATGCGGCAGTGCTAGCGCTATCAGATGCCCAATATGTTGCTCCATTAGTTCAAACGGGAGATTTTCTGAGTGTTAGCAAAAGTTCTATCGGGTCTGGACCAAATACCTTCTTTTATGTATTCGATTATCAAACGAAGGACGGCGATTACCCGCAGCGTATGGGTTCTGTACATGGTGAGGAGCTGCCATATTTATTTGGAGCGCCATTAGTAGAAGGACTTGGACACTTTCCCAAAAACTACACAAAATCGGAAGTGGCACTATCTGAAGCTTTCATTCTCTACATAGGTAACTTCGTCAGGACTGGGAATCCCAATGAAGCCCAAAGACAGGAGGCTGTCCTACCGATATCAAGAGAAAGAAACAAATTTAAAAGCATTTTCTGGGATGAGTATGATACGTTGCATCAAAAATATTTAGAAATTGGAATGAAGCCTCGTATGAAAAATCACTATCGTGCACACCAACTGTCTGTTTGGCTTCGTCTCATTCCTGAGATACATCGCGCTGGAATGAAAGACGTTGTCGCTAAACACAACCTCTTCCGTAACCACAATGATCCTGAACTATACGACGGACTGGTCCGACCCGATCCACTAACTCGCTACAACTATTACGACCCCACTCTTGAACTCTACAGACGTCCCAATCTTACATTATTAGATATACCATCCACAATAGAAACTTACGTTACCACATGCGTCAGTGTCATGTCGCCCCGACCCGACTCGCTTGTCACTCAAAGTCAGACCAACATATCGCATCCGCAAGACGTATCCAATTTAGAGGTTGCTGGTTACACTGCGTATTCCACAGCTTTGAGTGTCACTATCGCGATTGGCTGCTCCCTGTTAATTCTAAACGTTTTGATATTTGCTGGCGTTTATTATCAGCGTGATAAAACAAGATTGCAAGTAAAGGCATTGCAGCAGCAAAAGAGAAATCAGAATTCGACATTTGACAGTGTATCGTCGAAACACCCTCACTATTTCGTTGGACATTCACAAAGTTCCAGCACAATAGTGGACATCGATCATCAAGACAAGAATGCTATTATCTCTATGACAAACCGTGTTCACCACTTTACAAATCAAAACTGTCCGAACGTTTGTCACACTGGGATACAAATGTCCAATTTGGCTCAAAAATCCAGTCCGACGAATAGGGGACAATGTACGACATTGCCAAGAAAAGTGGGTTTCAGTTATCAGAATCAAATTTGTAGCCCATCCAATTGTATGACTCTACCAAAAAATGCAACGTTCATGAGCAGTAGCAACTTGCCAGACGTACAAGCACAGACAGGGCAATCTCAAACATCAGGAAATGGCTCGGTCCCTTCCTCTTCACCACCTTCTCAACATTTCTCTCAAAAATCCCGAGTGCCGCAAGCCGCGATGTCCGAAATGAACGTATGA

Protein sequence:

>DPOGS214902-PA
MAIHKLPLHICNSIKQYWLKIGIVKKLEKSSKTNAVNRKCKLETQERFSLVILLVLLLSVNSSGNSFSGGRNSMLRTRIIGTRYGKLQGVILPMDQHKYLKPVEAYLGVPYATPPTGSNRFAPTRAPAPWDEVRTVDQMGPVCPQRLPDITNETITLERMPKGRLEYLRRLLPRLKNQSEDCLYMNIYTPVQVGPTLQAKYPVVIFIHGESFEWNSGNVYDGAVLASYAGLVVITINYRLGILGFLNANPIPHLKARVANYGLMDQIAALHWVQQNIALFGGDPGNVTMLGHGSGAACINFLMISPTVMPGLFHRAILLSGSALSSWALVEDPVSYSVQLAKQSNCTLPEDIVKDHELIVDCLREVPLQELMSAEISTPSYLTAFGPSVDGVVVKTDYAKELLTFFIPNDLQGFTSVSGVNNVKMDKRSGDRIFGIRGGQNKYDLLFGVVTSEALWKFSAQDIQNGFEGERRDRIIRTYVRNAYTYHLSEIFFTIVNEYTDWERTVQHPINTRDAAVLALSDAQYVAPLVQTGDFLSVSKSSIGSGPNTFFYVFDYQTKDGDYPQRMGSVHGEELPYLFGAPLVEGLGHFPKNYTKSEVALSEAFILYIGNFVRTGNPNEAQRQEAVLPISRERNKFKSIFWDEYDTLHQKYLEIGMKPRMKNHYRAHQLSVWLRLIPEIHRAGMKDVVAKHNLFRNHNDPELYDGLVRPDPLTRYNYYDPTLELYRRPNLTLLDIPSTIETYVTTCVSVMSPRPDSLVTQSQTNISHPQDVSNLEVAGYTAYSTALSVTIAIGCSLLILNVLIFAGVYYQRDKTRLQVKALQQQKRNQNSTFDSVSSKHPHYFVGHSQSSSTIVDIDHQDKNAIISMTNRVHHFTNQNCPNVCHTGIQMSNLAQKSSPTNRGQCTTLPRKVGFSYQNQICSPSNCMTLPKNATFMSSSNLPDVQAQTGQSQTSGNGSVPSSSPPSQHFSQKSRVPQAAMSEMNV-