Monarch geneset OGS2.0

DPOGS201376
TranscriptDPOGS201376-TA4803 bp
ProteinDPOGS201376-PA1600 aa
Genomic positionDPSCF300083 - 3676-20259
RNAseq coverage239x (Rank: top 43%)
Annotation
HeliconiusHMEL0021570.077.36% 
BombyxBGIBMGA000698-TA0.068.94% 
Drosophilablue-PA0.038.97% 
EBI UniRef50UniRef50_E2C3S20.044.52%NHR domain-containing protein KIAA1787-like protein n=7 Tax=Formicidae RepID=E2C3S2_HARSA
NCBI RefSeqXP_001601781.10.043.57%PREDICTED: similar to ENSANGP00000008696 [Nasonia vitripennis]
NCBI nr blastpgi|3071954220.044.52%NHR domain-containing protein KIAA1787-like protein [Harpegnathos saltator]
NCBI nr blastxgi|3504216690.038.18%PREDICTED: neuralized-like protein 4-like [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[223-348] IPR0065732.9e-50NEUZ
[5-171] IPR0089854.5e-07Concanavalin A-like lectin/glucanase
Orthology groupMCL15327 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201376-TA
ATGAGGTTTCACAGACGATGTGGTGACAGAGTCACTTTACTCCATGACAATACAACTGCTGTAAGGAATTTTTTAGAATTTAATCATGGTCTCATTCTAAGTGCAGAACCCATTTTAGATGATGTTTTGTTTGAAGTTTGCATCGATAGAAAGGTTAATGTTTGGAATGGAAGTCTCGAAATTGGAGTTACCACCTTGGACCCCGAGTTTATGGAACTACCTGCCACTGCTACTAAACTTCGTAATACTGCATGGATTATGTCCGGAAGCTCAATCGTCAAAGATGGAATAACTTTGGTAAATTCATATGGTCCTGAACTAGACACACTTAGAGAAGGAGATACTCTTGGTGTTATGAGATCTTGTAAGGGGGAGCTGTTTTTTTATATAAATGGTAGATGCCTAGGTGTGGCTGCTTGTGATCTACCCCCAAGATTGTTTGCTGTCATTGATTTGTACGGACAGTGTGTGCAAGTATCAATAATTCACACAACTGGTATGAGAACAATAATGGAGAGTAGTATGGATCAGATTGAGGAAGATCCAAACAATGATGATACATTAACACCTACTGAAGAAGCCAGTGCTGTACCGGAATCATCAACATCCAAGGAGCAACAGGAAGTCTATATTTCAATGCCTGATTCGTCATCCATCTTAGCTTCTCATGATCGTCTACGTTTCCATCCTAGATGTGGAATTTTAGTCAAACTTTCTTCTAATAACAAGACTGCGGAGAGAGCTCGACCATTAGACGATTACAACAACGGCGTCGTTATGACGCACCGGCCGCTATACGACAACGAGCTTTTTGAGATTCGTATAGACAGACTGGTGGACAAGTGGAGTGGCAGTATAGAGGTCGGCGTGACAACCCACAACCCGGCAACCATTCGCTTCCCGTCAACTATGACGAACATCGACACCGGCACTATTATGATGTCGGGTTGCAAGGTGTTGCTCAATGGACATGGGACATGTATGGAATATGGGAACTTCAATCTTGATGAACTTCGGGAAGGTGACACAGTGGGTATGCTGAGGAAGTCCACAGGCAAGCTGCACTACTTCATTAATGGTATAGATCAAGGTGTAGCGACCGACAAAGTGGAGCAGCAAGTATGGGGCGTCGTCGACTTGTATGGCATGACAGTCAAGGTATCTATAGTGGATCCTTGTGAGGATATCGATAGTAATATCAACAACATGCCCACTGTATCGACTGTACCCGATCTTCCTGCTCCTAGTAGATTCACACCGAGGATAGATGAAGATAGCCTTTTATTTCATACGCTGCGTGATTCGTATGTTATTATTATTAACGATGGTAAGACTGCTCATAGGCCTAATGCTTTCGAATACTTCAACAACGGCGTAGTGATGACGAACAGATCTCTACGAACCAACGAGTTGTTCCAAGTGCGACTCGATCTGGTGGTGCCCAAATGGGCTGGAAGCATTGAGATAGGAGTCACGCAGCACACTTCTAATGATATTAAAATTCCATTTAAAATGAGCAATGCCAAATCAGGCACTTGGGTCATGTCCGGTGAAGACGTTATACAGGACGCGATGATTATTATACCAAAATACGTCAGAAATCTGAATAGATTAGTGGAAGGGGACACTGTAGGAGTGATGCGAAAAGATATGGGCATTCTACATTTTTTTGTGAATGGGGTCGATCAGGGGCCTGCTGCTTTTAATATACCAGAACACGTTTTTGGGATCATTGATTTGTACGGTCGTGTGGCTCAGGCTACAATAGTGGATTGCTATTCACCACCGACCACCTACTCGCCCGACTCGCCTATTTCAACGGAGTCCAATGCAACGATTTATCCCGAGATGTGTTTCCATCGCGTGCACGGTCGCAACGCTCGTCTGAGTCGCAGTCGCCTGACAGCGTCAAGGGCGGCGGTATACTCGGAGTTCAACGACGCGGTGCTGTTCAGTTCGCGGCCGCTGAGAGAGTGCGACATGTTCGAACTCAGGATTGACTCGATGGTGGACTGTTGGATCGGCAGCGTAACAGCAATCCGGCCGGACGATCTAGAAGCGAATGGTTTAGCTGGTACGGCGACAGATCTTAACTGGGATACGTACATACTGAGCGGTGCTGCTATGATGAAGGATGGGGAGTGTGTTCGTAGCGGGTACCCTCTAGATTTGGATACATTGACTGTAGGCAGTAGAGTTGGTATGATGTGGCATGCGGACCGCAGTCTGCACTACTACTTGGACGGTATGGACATGGGCAAGGCTTGGTACGTGCCACATCTCAATATATACGCCGTGGTAGATCTTTATGGCCAATGTACTCAGGTGACTATTCTACAAAATGAAGAAAGAGCGTTCAATTATAACGGCTGCACAAATTCTGACAATTCGATCTTGAGCAACTCAAGAGCAGTGACGAGCTTCTCTGAATACTACGGCGACAATATCTGCATGTTGAATGATTACTCCATCGCGTGGCGTCACTTGCCTGATCCCATGGCGGCGATAGTGTTCAGCTCCACACATCTTGCTATTAGCGAGATGTTTGAGATAAAAATCGTGGAGTGCAAGTACGGTTTCGCTGGCAGCTTCCGTATGGGTGTAACTGATATCAACATATTGAACGCACACGTCAACAGCAGTCTGCCGCCATCTGTCGCATGTCTACCACACTACACGGCCTATATCGATGGTAGGTATATAAGATATTCGAGGCCGGGGAGCCGTCAGCAGGATCTCAAGGTGATGGTGCCGTCATTCGAGTGGCTAAGACCGGGGGATCGTATTGGACTCAAGAAGACCACTGACAATAGAGTTCTCATTTACTACAACTGTGAACTGTTAGAGGTCGCCTTCGAGAAAGTCCCCGATAAAGTATATGTGGTGATGGAAATCTACGGATCGGTGTACAAGATACAAGCAGTGACGAGAGGGAATCCCGTAGCTGTTTTACCTCAGACGATCCCCAACGATGTATGCCTGAAACCGGGAGCAGAGAAGTTAGAGCAGACGTCGAACTCAGAATCAGAGACAAGTAGTGCGATGACCGTCGGGCCGGTTGAGAGAAGACGACGAGCCGCACTTCCATACACCTTCCACTATGTACACGGACGTAATATAAAACTATGTAGTTCAGATACAGTCGCTATGCGTGTATCGGGCTATCAGGACGCGGTCGCTATCGTCAGTCAGCCGCTGAGACGTGGTCACAGATTCAGGTTCCGTGTGGACAAGATCGACTCCAGCTGGGACGGCAGTCTGGCCATCGGTGGCGTAGCGTGTCTCCCCGACGGCGTGCCCGAGAGCGCCATGAGACTCGACCCGCCCGTCTGGCTGCTGTCCAGTGATTTACACTACGATGGAAATAATCTTATAACCCATACAACGTCTGAGCTAGAAGATATATGCGAACAGTCAGTGTTGACGCTCCACTTCCGGTTCAATAGCGAGCTTGTGGTGGACATCGACGGCGTGGTCTTAGGAGTGGTCGGCTCAGTGCCTAGGACGCATAGCCACGTCTACCCACTCATAGATTTATATGGAAAGGTTTGTCAGGTATCTGTACTTTTACACCCGTACACCGTGGTATCCGGGACTTCCCTTGATTTACCTGTGATAGCGGAAAATCGTAGAGAAGAAGAATCTTTGGCAGAATTGAATCTAGAAACTTCACAAGACGACAGCGTTCCTATTATTGAACGAGTGGATTTACAACGTCGTCGAAAAACTAAAGCATACAGCCAAGATAACCTTGTCGAAGTATTTGATCCCATTAACGTTGAGCCCGGGCCGTCTCACCCTAGACCGACTGTTCCCGACAATGAAATGAAAAATAAAGAAATTAAAAGTATTACTCCCGTTCGTACCATGCATCACAGTCTTATCCTCAATCATTCCAATGACAACAAGGATTTGTCATTTCGAACCGTACAGAGAAGTCATTCTAGTCACGATTTTTGTCGCATGTCTGATTCAATGGACGTTAGAGATGCTCTCCGGCTTACATTCATCACGGGCTGCCAGGACGACGATCATGAATTACATGATCAGTTCTGCGACACTAATATAAGACACAATGATCGCTACCCAGAAGCGAATGATGTTGATAATCTAGATAATGAAATTATGGATGCATTAACACTGGAAACGGGCAACTTGCCAGATTTAACCGAAGAGCAATCATCATCGATAGACGAATCCACTCGTCGCGATGATTTATATTCGGAGCCATTGTCTATTGAGAATTTAAACTATTTGAATCGTATATTATGTTTGGATCAGGACGGTTTCGAGGGGCAGAGTCTGGAGAGCGAATGGGAAGCTTTTGGTGAAGGTTGTGAGCATCTACATTTAGTATTGAAGTACTGGAATTACCTTGTTCTACCTTATCCGGAATTGCGTCAGGCAGTCTCTTGGGGTCAAGTCCGATGTTATTGCAGCAACTGCCAGCCTGATGCCCAGCCCCCACTAGCTGGGTGGGTTCGTATAGAGCGTATAGACGGAGTTGGTGCGGCCACACGAGGCTGGTGGCACGTGACGCGAAGTACGCTCGGCGCGGCTCAGGCTCACCGCAACGACGTCAACCCGGCTAAGAGACCGCGAGCACTCGCTCCAGCACCGACCAGCGGATCACTTGGTGACATATGGCTGGATGAAGAAGGGAAACCTCATCATACGATATTGGCTATAGAAATTGACATCGAAGGAAATGAAGCTGAGAAAGATCGACTTTTAGCTTTCTTAATCCACATGAAGTCACATGCAGTTTTCGTCGATGATAATCCTTCTAGTATAGATGAATAA

Protein sequence:

>DPOGS201376-PA
MRFHRRCGDRVTLLHDNTTAVRNFLEFNHGLILSAEPILDDVLFEVCIDRKVNVWNGSLEIGVTTLDPEFMELPATATKLRNTAWIMSGSSIVKDGITLVNSYGPELDTLREGDTLGVMRSCKGELFFYINGRCLGVAACDLPPRLFAVIDLYGQCVQVSIIHTTGMRTIMESSMDQIEEDPNNDDTLTPTEEASAVPESSTSKEQQEVYISMPDSSSILASHDRLRFHPRCGILVKLSSNNKTAERARPLDDYNNGVVMTHRPLYDNELFEIRIDRLVDKWSGSIEVGVTTHNPATIRFPSTMTNIDTGTIMMSGCKVLLNGHGTCMEYGNFNLDELREGDTVGMLRKSTGKLHYFINGIDQGVATDKVEQQVWGVVDLYGMTVKVSIVDPCEDIDSNINNMPTVSTVPDLPAPSRFTPRIDEDSLLFHTLRDSYVIIINDGKTAHRPNAFEYFNNGVVMTNRSLRTNELFQVRLDLVVPKWAGSIEIGVTQHTSNDIKIPFKMSNAKSGTWVMSGEDVIQDAMIIIPKYVRNLNRLVEGDTVGVMRKDMGILHFFVNGVDQGPAAFNIPEHVFGIIDLYGRVAQATIVDCYSPPTTYSPDSPISTESNATIYPEMCFHRVHGRNARLSRSRLTASRAAVYSEFNDAVLFSSRPLRECDMFELRIDSMVDCWIGSVTAIRPDDLEANGLAGTATDLNWDTYILSGAAMMKDGECVRSGYPLDLDTLTVGSRVGMMWHADRSLHYYLDGMDMGKAWYVPHLNIYAVVDLYGQCTQVTILQNEERAFNYNGCTNSDNSILSNSRAVTSFSEYYGDNICMLNDYSIAWRHLPDPMAAIVFSSTHLAISEMFEIKIVECKYGFAGSFRMGVTDINILNAHVNSSLPPSVACLPHYTAYIDGRYIRYSRPGSRQQDLKVMVPSFEWLRPGDRIGLKKTTDNRVLIYYNCELLEVAFEKVPDKVYVVMEIYGSVYKIQAVTRGNPVAVLPQTIPNDVCLKPGAEKLEQTSNSESETSSAMTVGPVERRRRAALPYTFHYVHGRNIKLCSSDTVAMRVSGYQDAVAIVSQPLRRGHRFRFRVDKIDSSWDGSLAIGGVACLPDGVPESAMRLDPPVWLLSSDLHYDGNNLITHTTSELEDICEQSVLTLHFRFNSELVVDIDGVVLGVVGSVPRTHSHVYPLIDLYGKVCQVSVLLHPYTVVSGTSLDLPVIAENRREEESLAELNLETSQDDSVPIIERVDLQRRRKTKAYSQDNLVEVFDPINVEPGPSHPRPTVPDNEMKNKEIKSITPVRTMHHSLILNHSNDNKDLSFRTVQRSHSSHDFCRMSDSMDVRDALRLTFITGCQDDDHELHDQFCDTNIRHNDRYPEANDVDNLDNEIMDALTLETGNLPDLTEEQSSSIDESTRRDDLYSEPLSIENLNYLNRILCLDQDGFEGQSLESEWEAFGEGCEHLHLVLKYWNYLVLPYPELRQAVSWGQVRCYCSNCQPDAQPPLAGWVRIERIDGVGAATRGWWHVTRSTLGAAQAHRNDVNPAKRPRALAPAPTSGSLGDIWLDEEGKPHHTILAIEIDIEGNEAEKDRLLAFLIHMKSHAVFVDDNPSSIDE-