Monarch geneset OGS2.0

DPOGS206091
TranscriptDPOGS206091-TA6378 bp
ProteinDPOGS206091-PA2125 aa
Genomic positionDPSCF300028 + 92662-106490
RNAseq coverage293x (Rank: top 38%)
Annotation
HeliconiusHMEL0121100.081.02% 
BombyxBGIBMGA006817-TA0.081.09% 
Drosophilasdk-PC0.061.95% 
EBI UniRef50UniRef50_E0V9W70.063.08%Protein sidekick, putative n=4 Tax=Neoptera RepID=E0V9W7_PEDHC
NCBI RefSeqXP_002422911.10.063.08%protein sidekick precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420039280.063.08%protein sidekick precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420039280.063.08%protein sidekick precursor, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055153e-17protein binding
KEGG pathway 
InterPro domain[1590-1697] IPR0137839.5e-26Immunoglobulin-like fold
[673-793] IPR0089574.5e-25Fibronectin type III domain
[888-969] IPR0039613e-17Fibronectin, type III
[302-384] IPR0130986.1e-14Immunoglobulin I-set
[407-472] IPR0035986.2e-14Immunoglobulin subtype 2
[492-575] IPR0035995.1e-13Immunoglobulin subtype
Orthology groupMCL11447 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206091-TA
ATGCAACCCTCGTCTTCAAACAGTATAGTAAGAGAAGGCACTACAAAAATTCTTCAGTGTTCAGCTATTGGCATTCCACAGCCAATGTACAGATGGCTTAAAAATGGTGTTCCGATTGGAGATTATTCATCGGAGTTATTTTTTAAAATACACAACACTCAACGCCAGGACGCTGGAGCTTACCAGTGTATCGCTAAGAACGACGTCGGAGCGATTTTCAGTGAAAAGAACAACATTGTTGTCGCTTATATGGGTGTTTTTGAGAACACTTTGGAACAAGTGGTAACAGTTGAGTCAGGTAAGGCTGCGATTCTCGACTTCCCACACATCGAATCAGATCCACCTCCATCAGTTATATGGCAGGACGAGAACGGCGTGAACGGCGTGCTGAGATACGATCAGAAATATGCTATCACTGATAAACATCAACTCGTCATTCTATGCTCTTCAAAGGACGACCAAAGAGCTTATAGAGCAAGAGCGATTAACACACAATTAGGTAAGGTAGAAAATAGTCCGTACATTAGGCTGATAGTCGATGGAGACGACAATAAGGAAATCGCCCCAGAAATAATAATTAAACCGCAAGACACTAAGATAATTAAGGGTCAAGAATATACAAACGTTTATTGTATAGCGAATGCGCGGCCGTTGCACGAATTAGAAACATTATGGTTCAAAGACGGCGTATTGATCGATTTGGCGGGCATTACATACGACCTGAATGATCAATGGAACCGAACGTTGAGTCTCATATCGGCCAATTTGAACCACACTGGACAGTACACGTGTCAAGCTAGGTTAAAGTCCGGCGGATTCGCGACAGTAACCGCATCAGCTACGGTCACAGTCTTCGAAAAACCGGTTATGCCGACCAGCTTGAAACCGGAGACTTTCGGAGAGTTCGGGAGCTCCGTTGTGTTAGAGTGCAACGTTCAAGGCATTCCAATCCCGAGTATAACCTGGTACAAAGACGCTAGGAAAATTGCTAGCGTCGGCGCCGATGCTGCGAGCGATAACTCCGATGTAGACGACGGCGGAGGCAGATACAGGGTGGATGTAGATCGATCGCTGGTCATCAGTCACCTGAAGATGGAAGACATGGGCATATATCAATGTATAGCTAACAATGCAGCCGGCGAGTCATCAATCTACTCCTGGTTGAAAATAAAGACGTCCCCGCCAATAATGCAGACAGGCCCCGCAAACCTCACAGTACTGGACGGTAAAGATGCCACCATTAGCTGCAGGGCCATCGGCGCCCCCACCCCCAACGTAACCTGGTACTTCAATGATTCTCTCATAATAAATCTGTCTGGAAGACTGCAAGCTTTAGACGAGGGAGACTTGCTAATTACGAGCGTTGCCACAGCGGATAGCGGGAAATATACTTGCATACGTGCGAATGACGCCGGCAACGTTTCAGGCGAGGCGTATCTTACAGTGCTCGTGAGAACACAAATTATCGCTCCGCCTGTGGACACGCGCGTTCTTTTAGGGCACACTGCCACGCTGCAATGCAAGGTGTCCAGCGATCCGAACGTGAAGTACAACATCGACTGGTTCCACAACAAACAACCTATGACAGCCGGTTCCCGTGTGTGGGTGTCCGTGGACGGGTCGCTGCAGGTGCAGGCGGTGCGGGCGGGGGACGCGGGGGAGTACACGTGCGTGGTGACGTCACCGGGTGGCAACCACACTCGGCGCGCGACCCTCTCCGTCATCGAGCTTCCCTTCTCACCGTCCAACGTCCGGGCGGACCGGCTCGCGAGTCCCCAGCGAGCTGTCAACGTGTCATGGACGCCCGGCTTCGACGGAAACTCACCCATACAGAAGTTCATCGTGCAGAGACGCGTCGTACCCGAATTTGGTCCCATCCCCGATCCTCTCCTGAACTGGGTGACGGAACCAATGAATGTATCGGCGAATCAACGCTGGGTGTTATTGACCAGTCTGAAGGCGGCCACTTCCTACCAGTTCAGAGTGTCGGCGGTAAACACAGTGGGCGAGGGTCCACCTTCCGATCCCACGGATGTGCTAACACTACCCCAAGAAGCTCCTTCTGGACCGCCGTTAGGATTCATGGGTTCAGCTAGATCTTCATCCGAAATTATAACACAATGGCAGCCACCGCTAGAAGAACATAGGAACGGTCACATACTGGGCTACGTGATAAGATATCGACTGAAAGGCTACGAGAACAGTCCGTGGACCTACCAAAATATAACAAATGAAGCCCAAAGAAACTATCTCATACAAGATCTGATAACGTGGAAAGATTACAACGTACAGATAGCGGCGTACAACGACAAAGGTGTCGGCATGTTCTCCGACAGTTACACCATCAAGACCAAAGAGGGCGTGCCTGAGGCGCCGCCGGACAGCGTCCGATGTGACCCGTACAACTCAACCGCCATACAAGTATGGTGGACGCCGCCCAACCCGCAAAAGATTAACGGAATCAATCAAAACGACAGCGTCGAACAGAAGCTGGTTAGCGTACCGCCCAATCTTTTGGATCCCCTCACGGAGCAGACGGCTGTCATTAATGGATTAGAAAAATTCACTGAATACAATATATCAGTTCTGTGTTTCACGGAACCAGGCGATGGTCCTCGCAGTGAGTTTATCAGCGTCAGGACGAAGGAAGATATTCCGGATGAAGTAATGAATCTTCAATTCGACGACATATCAGACAGAGCAGTGAGAGTGTCCTGGTCCCCGCCGAAGAAGTCCAACGGCGTCCTAATCGGATATAAACTTAAATATCAGATTAAAGAGAACCCTGAAACTTTTAAGGAAGAAATATTACCGCCCAACGTGACTAGCGTCAGAGTGGAACATCTGCAGGCTAGTACACAGTACCAGTTCTGGGTGAGCGCTCTGACCGGGGTGGGCGAGGGCGGGGCGCGGGCGGCGGCGCTGCAGTCCGGTGTGGAGCCCGTGCTGCCGCGAGCACCCACCAACCTCGCCCTCTCCAACATAGAGGCGCACTCCGTTCTGCTGCAGTTCACGCCCGCCTTCGATGGAAACTCGTCAATATCACTGTGGACCGTTCAGGCACAAACCGCCCGCAACTCGTCCTGGGTGACAATATATGAAGTTAACGCTCCCGACGCTCAGTCGATCCTGGTAACCGGGCTGATACCTTTCACCACCTACAGACTGCGACTCATCGCTACTAACATAGTCGGCTCCTCCCCACCTTCAGAACCCTGCAAAGAGTTCCAAACCATACAAGCACCCCCACAACATCCGCCGAGAAACGTTACCGTTAGAGCTGTCAGCGCTAATAATTTACGCGTTAGATGGATTCCTCTCCAACAAAGTGAATGGTATGGCAACCCCAAAGGTTATAACATCACATACAAACGCAGCGGCAGTAATGATACTCTATATAGTATTATAGACGATCACACGGCCAACTCACACGTATTATCAAATCTCGAGGAGTGGTCCGTGTACGAAATTACTATGACGGCTATTAATGAAGTTGGCACTTCTGCTGTCAGTCCTACGGCCACTGAAAGAACTAGAGAAGCAGTGCCATCTAGCGGTCCAATAAACGTATCTGCGAACGCGACGTCATCCACCACCGTAGTGGTGCTATGGGGTGATATACCCTTACAAGATCAAAACGGTCTCATCGAGGGATACAAGGTGTGCTACGCCGCGGTCGTGCCACCGCCGCGACCAGAACACAAGAAAGTTGAATGCCATCCGATACCATCCAATCAGACACACACTGTGACGCTGACGGAGCTAAGGAAATACGTGGTGTATCAAGTGCAAGTGTTGGGGTACACGAGGCTAGGAGATGGCGCGCTCAGCGACCCTCCAGTTACTGTCAGAACCTATGAAGATACTCCCGGTCCACCATCGAACGTATCGTTCCCTGACGTGACGTTCACGACCGCTCGCATCATCTGGGACGTACCTGAGGATCCCAACGGGGAGATCTTAGCCTACAAAGTCACATATCATCTGAACGGCTCAACGCTTCACATGTTCTCCAAAGAGTTTTTACCATCCGACAGAACTTTTAGAGCGACCGAGCTGGCCTCAGAGCAGTACTACGTGTTCTCGGTGAGAGCTCAGACCCGTGTGGGGTGGGGAGGCACGCTCCGGGCGTTGGTGTTGACGACGGCCAATCGCGCCGCGCCCGCGCCGCCCGTCACGCCCAACGTGGCGCGATCACTGCTGCAGCCGCATCACATCACCTTCTCGTGGACGCCCGGCGACGATGGATACGCTCCACTAAGATATTACACGATACAACAGAAAGAGGAGGGCAGCACGTGGCAGACTCTCCCTGAACGCGTGGATCCGTTCGCTACCTCGTACACTGTGGATGGACTGAAACCGTACACCGCCTACCAGTTCAGGATACGAGCCACCAACGACATCGGACCCAGTAGATATAGTAATGCTACAGAGACCGTTAGGACCTTGCCGGCTGCCCCAAGCAAAGCAGTTGAAAAGTTGGTAGTAGTTCCCATAACTCCGAGCAGCGTCCGAGTACAGTGGCGGGCGCTCGGCGAACAACACTGGAGCGGGGACACGCGCACCGGGGGCTACTCAGTGTCGTATCAGCCGCTTACCGACTTCACCTCTCTTCAACGCGCTATGAAGAGAGAAGTCCCCGGGATCAAGTCAGAAGAGGTGATTCTCACGGATCTAGCGGTGGATCGTAACTACGAGATCAGCGTTTGTGCGGTGAACTCTCAGGGCGCGGGACCGGTGGGCGCTCCGGCCGTGGTGTGGGTGGGGGAGGCCGTACCTACGGCGCCTCCCCAACAAGTGGAGGCTAGGGCTCTCTCCCCCACAGAAGTAGCGCTCACTTGGCATCCACCGCTGCAGGCCCAACAGAACGGAGACCTGCTCGGTTATAAGATCTTCTACCTAATGACGGAATCTCCCGAGGAACCTGAACCAGGACGTCGTGCTGAGGAAGAAATTGAAGTAGTTCCTGCCACAGCTACGTCACACTCGCTGGTGTTCCTCGACAAGTTCACACAATACCGAATACAGGTTCTAGCGTTTAATCCAGCGGGCGACGGTCCTCGGTCGACTGCCATACAAGTGCGGACACACCAGGGCCTCCCTTCAGCTCCTCGTAACATAACATTCACAGACATCACAATGAACAGCCTCGTAGTGTCCTGGGAGCCGCCGCAGAGACGAAACGGACTCATACACTCGTACCTCGTCACTTACGAGACCATAGAGCAGGATGAGCGTTTTAGCAAGCAGGTGAAGCAGAAGGTTAACGAGCGACGGCTGGCGGTGGGGACCCTGGAAGAGGAGGTGGAATATCGGTTCAGTGTGAAGGCGGTGACTGTTGGATCCGGGGCTGCGGCGGAGGCGCGTGTGCGGACTGGACCGCAGCCCGGCTCGCCCTCACCACCCGCGGCCCTGCGACTGAGGGCCGATGTAGCCGCTCTTACTATGAAGTGGACCAATGGAGCCTCAGGGCGAGGGCCGCTGCTGGGATACTACTTCGAAGCCAGAAAGAAAGACGACACCAGATGGGAGACTATAACGAGAACGAGTAACGGTATTCTTGAAGAGTTCACGATCTCATATCAGAGTCTGCTTCCTTCCACCGCGTACTCGTTCAGGGTGATCGCATACAACATGTACGGGATCAGCAACCCGGCGTACAGCGACAAGGTCATCGTGACACCGTCCAAGTTGTATCTAGAGTACGGGTACTTACAGTACCGGCCGTTCTATAGACGCACCTGGTTCATGGTCGCCTTAGCCGCGGCCTCCATTATCATCATCATCATGGTGATAGCGATACTATGCGTCAAAAGCAAAAGTTACAAATATAAAAAGGAAGCACAGAAGACGTTAGAGGAATCACTCGCCGGGGAGACGGACGAGCGCGGCTCGCTGGCGTTGGACATGTACCGATCGCGGCAGAATTCCGTAGCGAGTGTGGGAGCACTGGGAGGGACGCTGCGCCGCAAGCCGGTCCACGGACCAGCGCTGGGCAAGTCTCCGCCGCGGCCATCCCCCGCCTCCGTCAACTACCGCAGCGACGAGGAAAGCCTCCGCGCCTTCGACGACCACCCCGACGACTCGTCCCTCACCGAGAAGCCCTCCGAGATGAGCTCCTCGGACTCCCAGAACTCTGAGAGCGACAACGAGAGCGTTCGATCCGAACCTCATTCGTTCGTGAATCACTACGCTAACGTGAACGACACGCTGCGTCAGTCCTGGAAGAGACAGCGGCCGGTGAGGAACTACTCCAGCTACACGGACTCGGAGCCCGAGGGCAGCGCGGTCGTCAGTCTGAACGGCGGCCAGATAGTTATGAACAACATGGCGCGCTCCAGGGCTCCGCTGCCGGGATTCTCGTCGTTCGTATGA

Protein sequence:

>DPOGS206091-PA
MQPSSSNSIVREGTTKILQCSAIGIPQPMYRWLKNGVPIGDYSSELFFKIHNTQRQDAGAYQCIAKNDVGAIFSEKNNIVVAYMGVFENTLEQVVTVESGKAAILDFPHIESDPPPSVIWQDENGVNGVLRYDQKYAITDKHQLVILCSSKDDQRAYRARAINTQLGKVENSPYIRLIVDGDDNKEIAPEIIIKPQDTKIIKGQEYTNVYCIANARPLHELETLWFKDGVLIDLAGITYDLNDQWNRTLSLISANLNHTGQYTCQARLKSGGFATVTASATVTVFEKPVMPTSLKPETFGEFGSSVVLECNVQGIPIPSITWYKDARKIASVGADAASDNSDVDDGGGRYRVDVDRSLVISHLKMEDMGIYQCIANNAAGESSIYSWLKIKTSPPIMQTGPANLTVLDGKDATISCRAIGAPTPNVTWYFNDSLIINLSGRLQALDEGDLLITSVATADSGKYTCIRANDAGNVSGEAYLTVLVRTQIIAPPVDTRVLLGHTATLQCKVSSDPNVKYNIDWFHNKQPMTAGSRVWVSVDGSLQVQAVRAGDAGEYTCVVTSPGGNHTRRATLSVIELPFSPSNVRADRLASPQRAVNVSWTPGFDGNSPIQKFIVQRRVVPEFGPIPDPLLNWVTEPMNVSANQRWVLLTSLKAATSYQFRVSAVNTVGEGPPSDPTDVLTLPQEAPSGPPLGFMGSARSSSEIITQWQPPLEEHRNGHILGYVIRYRLKGYENSPWTYQNITNEAQRNYLIQDLITWKDYNVQIAAYNDKGVGMFSDSYTIKTKEGVPEAPPDSVRCDPYNSTAIQVWWTPPNPQKINGINQNDSVEQKLVSVPPNLLDPLTEQTAVINGLEKFTEYNISVLCFTEPGDGPRSEFISVRTKEDIPDEVMNLQFDDISDRAVRVSWSPPKKSNGVLIGYKLKYQIKENPETFKEEILPPNVTSVRVEHLQASTQYQFWVSALTGVGEGGARAAALQSGVEPVLPRAPTNLALSNIEAHSVLLQFTPAFDGNSSISLWTVQAQTARNSSWVTIYEVNAPDAQSILVTGLIPFTTYRLRLIATNIVGSSPPSEPCKEFQTIQAPPQHPPRNVTVRAVSANNLRVRWIPLQQSEWYGNPKGYNITYKRSGSNDTLYSIIDDHTANSHVLSNLEEWSVYEITMTAINEVGTSAVSPTATERTREAVPSSGPINVSANATSSTTVVVLWGDIPLQDQNGLIEGYKVCYAAVVPPPRPEHKKVECHPIPSNQTHTVTLTELRKYVVYQVQVLGYTRLGDGALSDPPVTVRTYEDTPGPPSNVSFPDVTFTTARIIWDVPEDPNGEILAYKVTYHLNGSTLHMFSKEFLPSDRTFRATELASEQYYVFSVRAQTRVGWGGTLRALVLTTANRAAPAPPVTPNVARSLLQPHHITFSWTPGDDGYAPLRYYTIQQKEEGSTWQTLPERVDPFATSYTVDGLKPYTAYQFRIRATNDIGPSRYSNATETVRTLPAAPSKAVEKLVVVPITPSSVRVQWRALGEQHWSGDTRTGGYSVSYQPLTDFTSLQRAMKREVPGIKSEEVILTDLAVDRNYEISVCAVNSQGAGPVGAPAVVWVGEAVPTAPPQQVEARALSPTEVALTWHPPLQAQQNGDLLGYKIFYLMTESPEEPEPGRRAEEEIEVVPATATSHSLVFLDKFTQYRIQVLAFNPAGDGPRSTAIQVRTHQGLPSAPRNITFTDITMNSLVVSWEPPQRRNGLIHSYLVTYETIEQDERFSKQVKQKVNERRLAVGTLEEEVEYRFSVKAVTVGSGAAAEARVRTGPQPGSPSPPAALRLRADVAALTMKWTNGASGRGPLLGYYFEARKKDDTRWETITRTSNGILEEFTISYQSLLPSTAYSFRVIAYNMYGISNPAYSDKVIVTPSKLYLEYGYLQYRPFYRRTWFMVALAAASIIIIIMVIAILCVKSKSYKYKKEAQKTLEESLAGETDERGSLALDMYRSRQNSVASVGALGGTLRRKPVHGPALGKSPPRPSPASVNYRSDEESLRAFDDHPDDSSLTEKPSEMSSSDSQNSESDNESVRSEPHSFVNHYANVNDTLRQSWKRQRPVRNYSSYTDSEPEGSAVVSLNGGQIVMNNMARSRAPLPGFSSFV-