Monarch geneset OGS2.0

DPOGS204153
TranscriptDPOGS204153-TA2214 bp
ProteinDPOGS204153-PA737 aa
Genomic positionDPSCF300034 - 617600-619813
RNAseq coverage1388x (Rank: top 9%)
Annotation
HeliconiusHMEL0041270.091.16% 
BombyxBGIBMGA005035-TA0.089.55% 
DrosophilaNeurochondrin-PA0.057.39% 
EBI UniRef50UniRef50_E2AZQ60.060.55%Neurochondrin-like protein n=13 Tax=Endopterygota RepID=E2AZQ6_CAMFO
NCBI RefSeqXP_001850261.10.064.14%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3479718080.063.78%AGAP013338-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479718080.063.99%AGAP013338-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[1-737] IPR0087093.3e-242Neurochondrin
Orthology groupMCL12195 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204153-TA
ATGGGAGACATATCTGAGCCAATACAAAAATGTATACTGATACTTAAATCAGCAAAGACTGATACAGAGAAGTTTGCAGCTTTGTTTATGGTGACGAAACTAGTGAAAAGCAAAGATTGTAATTCTACAGCGAAGAAAGCTTTGTTTGAAGCAATAGGTTTCAAATTCTTAAAGAAGTTGTTAACTGGAACCAATGTGCAAGATGATTGTCCGCCGTCTGTTTATAAATCCGTCGCACTGTCTATTCTTACAAATTTCTGTAATGAACCTGAGCTTTCTTCACATCCTGAAATGCTTGCAAATATACCAGTTTTTCTAGATATAGTACAAACATCAGACAATGAAGATTATGATGATAACCTCATTATCATAAGTGAAGCATACACATGCTTACAATGCATTGCAGAACATGAAGCTGGTCAAAAGGCTCTAATAGAAGTTGGTGCCATTACTAAAATGTCTGAAATTTATTCTCACCAAAGTTTTCAAACTGACGAAGCTCTTAATATTCTCGTCAAATTAGTAAGTCGTTATGGCCCTGCTGCATGGGGTAATGATCCCAAGCCCTTTCACTCTTTAGTCAATAAAATTGCTCTTGACTTTGCAACTGATCAATCTGAAAGGAAATTTGAACTAGCTACCATACTCAGTGCTCTATTATATAGTTGTAATAAGTCAACGGTTGTACCAGGTTTTGCCGATGAAACCTGGCACTTGAGTATATATAAAGCCCTATATGATATTTTAACAAGTAAAATAGGTAAGAATCAAAGAGACCCAGCTTTGAAATTAGCAGCCAACATTATTGATTTACTTGGAGTTGAATGGACATTCAACGATGAAGAAAATCCAAAAAAATTCTTCCTGTTACTATTGCAACTGTGTGCTATTGAAGTAAGGATGCAGTTGGAAGATAGAAGCTTTAAACAAGCATTTGCAAATGCTGAACTCGTTACAGCCTGTTTCATTGTGTTAGAATCTTCAATTAACTATATGGGTACAGATCAACTAGATTTAGAACAAAAAGAGAAACAATCTGTGTACACCAGTCTCAAGGGAGCATTTAATGCTGTTGTATCTATTTTAACAAAAGTCTCAAATGACAAAAACAAAGATAAGCTTCCTGATGCTGAGAAAGTTTTCATCTGTGCTATGGTCAGAGTTTTAGTTGCATGGATTGCTCAAGAAACTACTGCAATGAGAGAACAGATTTATGCACTATTACCATATATTTTTACCCTTGCAAATGATTCCTTCCATGCTCATCGATCCAGAAAAATGGCTGAGAAGACCAAAACTGAAGGTGAGCCTATGGATATGGACATATCTTTATTGGGTCAAATAGATTTACTACGCTTAATGTTACCAGCTTTGTGCCATCTTGTTGTTGAAGATAAAGCTAGAGATATAGTACTTAACCTTAAAGAGGAAGATATTCTATATGAAGCTATGAATTTCCATTGGTCAATTGTTCATTATAAGAAACCACCCATCCCAAAGTCAGAAAGAGGAAAAGCAAGGACACAACCAGAACCTGAATTAGATCCAAAAGTTTTGGAAGACATGAAGGATTCAAGAGCTGCAATGGTCAGCCTTTGCAATATTTTTATGAACTTGACAGTTTTGGCTCCTAAAGTTGTTGAAAACAGTATGTTATTTAACACTTTACTTAAATTTATTTTTAACAATTTACCAGAATTAAAAAATATACCTGACAACCTTGTACTTCATGGCCATCTAGCTGTGCTAGGTTTACTTTTGTTAAAACAACAGGCTAGCAAAGTGAAAAAGAATGATTTTTCAATATGCAGATACATACAATCAACAATCCGATTCCTCTGGGATGCATACAATGTTGATGAGTCCAATGATCCTAATGAACTTGTTGTGTCAATGACCTATAAGGAACATTGGAATGAAATAGCTGATTTGTGGTTCTTGGGAATGCAAACTTTAAGTGGAGTCTTGCAGACAATCCCTTGGATCTCTGAATTTGCTATAGAGAGTGGATGGGCTCAAGGAATAGCCGAAATGCTCGTAAAAGTCAAAGTTGGTACTTTACCCCCCAATGTTAAGTCAGCATTTGAAGATTTTCTTTGCAGGCTTGTTGACTCGAATGAAGGTGCTATTCCTGTCTTGAAAAAGGGTGGTGCTTTGAAAATGTGCAGAAACCATAGATTAATGGATTTGGGCAAAAAACTCTTCGGAGATTAA

Protein sequence:

>DPOGS204153-PA
MGDISEPIQKCILILKSAKTDTEKFAALFMVTKLVKSKDCNSTAKKALFEAIGFKFLKKLLTGTNVQDDCPPSVYKSVALSILTNFCNEPELSSHPEMLANIPVFLDIVQTSDNEDYDDNLIIISEAYTCLQCIAEHEAGQKALIEVGAITKMSEIYSHQSFQTDEALNILVKLVSRYGPAAWGNDPKPFHSLVNKIALDFATDQSERKFELATILSALLYSCNKSTVVPGFADETWHLSIYKALYDILTSKIGKNQRDPALKLAANIIDLLGVEWTFNDEENPKKFFLLLLQLCAIEVRMQLEDRSFKQAFANAELVTACFIVLESSINYMGTDQLDLEQKEKQSVYTSLKGAFNAVVSILTKVSNDKNKDKLPDAEKVFICAMVRVLVAWIAQETTAMREQIYALLPYIFTLANDSFHAHRSRKMAEKTKTEGEPMDMDISLLGQIDLLRLMLPALCHLVVEDKARDIVLNLKEEDILYEAMNFHWSIVHYKKPPIPKSERGKARTQPEPELDPKVLEDMKDSRAAMVSLCNIFMNLTVLAPKVVENSMLFNTLLKFIFNNLPELKNIPDNLVLHGHLAVLGLLLLKQQASKVKKNDFSICRYIQSTIRFLWDAYNVDESNDPNELVVSMTYKEHWNEIADLWFLGMQTLSGVLQTIPWISEFAIESGWAQGIAEMLVKVKVGTLPPNVKSAFEDFLCRLVDSNEGAIPVLKKGGALKMCRNHRLMDLGKKLFGD-