Monarch geneset OGS2.0

DPOGS202291
TranscriptDPOGS202291-TA4521 bp
ProteinDPOGS202291-PA1506 aa
Genomic positionDPSCF300032 + 66250-93603
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0130300.082.03% 
BombyxBGIBMGA005747-TA0.037.33% 
DrosophilaDscam3-PB0.036.20% 
EBI UniRef50UniRef50_D7GY920.037.75%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7GY92_TRICA
NCBI RefSeqXP_968319.20.039.10%PREDICTED: similar to CG31190 CG31190-PC [Tribolium castaneum]
NCBI nr blastpgi|1892421220.039.10%PREDICTED: similar to CG31190 CG31190-PC [Tribolium castaneum]
NCBI nr blastxgi|1892421220.038.98%PREDICTED: similar to CG31190 CG31190-PC [Tribolium castaneum]
Group
Gene OntologyGO:00055152.9e-13protein binding
KEGG pathway 
InterPro domain[549-681] IPR0089571.8e-23Fibronectin type III domain
[354-462] IPR0137834.7e-23Immunoglobulin-like fold
[467-559] IPR0130983.2e-15Immunoglobulin I-set
[566-659] IPR0039612.9e-13Fibronectin, type III
[99-178] IPR0035981.1e-11Immunoglobulin subtype 2
[379-465] IPR0035996.7e-11Immunoglobulin subtype
Orthology groupMCL10022 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202291-TA
ATGCGTATTGAATGGGACTTTGAAGAAACGAATGAAATTCAAGAAAACGACCCTGACAGTTTACTACTACACAAACTTAATCAATCCAATCAGAAGTATAGCTGTTCTGTGGTGGATGGCTCAAAATACACTTGCGATGATGTGAAGAGATTGATGAAACGAGATGAAGAAAGTATTGTTATATGGTGTTTGTTGGTCCGAGGAGAGGATTCTGTTTACTTGAAACGTGAACGATCGAAAAGGTCGCCTGATTCCCGTCCGGAATTGGTTTACACTTTCATCGAACAGGCTGTACAGCCGGGGGCGCAAGTAGCTCTCAAATGTTCTGCAGTTGGAGAACCACCTCCGCGTTTCAGGTGGATCCTGGATGGGCTACCCATACCAGCACATCATGGGGCCATGGTAACGGAAGGTCGTGAAACGGGACAGTCAGGTTTGGGGTCCAGCAATTATATACTGTCTACCCTTTCACTGAACAGTGCACGTGTGGAACACGGCGGGAGATATGAGTGTAGAGCTACCAACACCCACGGATCAGTCGCTCATGCTGCGAGACTTAATGTTTACGGTCCACCATATATACGGTCAATCAATCCCGTGAAAGCTGTCGCTGGTTCAGATATCACCATTTGGTGCCCATATTACGGCTTTCCAATTGACTCGGTGAAATGGGAGGGCGGTGCTGGCAATGATCCCAGATACCAACAATCCGATGGACAACTAACTATCAGCAACGTCGATCGTACTAGGGATAAAGGTGGCTGGATCTGTTCTGTTCTAACTCCTGGCGGAGAGCTTGCTAGAAGGGAGGTTCAAATAAACGTTGTATCCCCGCCAGTACTCTCCCCTATAGTATTTCCTCCTGGACTAAGATCAGGCGATCGAGCACAACTTACTTGCACAGTGACGTCAGGAGACATGCCCGTGTATTTTTCATGGCTCAAAGACCAAATGCCCATCTCTAGCGTACTTCAGGTAGACGAACGTGGAGCTGAGTTCTATAGTATGCTCCTTTTCAAAAGCCTGACCGCGGCTCACAGCGGAATATACACCTGCGTCGTTACTAATACCGCTGGGAAAGCAAACATGTCCGCTGAACTAGCTATCAAAGTACCCCCGTATTGGCAACTGGAACCTTCTGATACATCAGTTTTACTGAACGGCTCCCTCACGGTCAGCTGTGAAGCTAGAGGACATCCTCCACCTCACATTTACTGGACCAAATTCACTGGCAGTACAGAAAGTGCATTGGGCGGAGTGTCGGATCCCGTAGTACTTGGCAACGGTTCTCTACGACTGGACTCAGCACGTGCAGAACATTCTGGGAAGTACCGTTGCAAGGCTGAAAATGGCGTGGCACCCGCTTTATCTAAAACCCTAACGATACACGTTAATGCTAGATTCGAAACACCGTCAGTGAATGTGACGGCTAAGCTTGGTGAGACTGTGAGACTGGCGTGTGTGGCGAGAGGAGATAATCCACTCTCCATGGCCTGGAGTCATTCTGGGAGGACCTTACCAAATTCTGATTACAGGATGAGTATATCAGAAACCAGAAGCGTTGAGGGTCTACGTAGTGAATTGGTCATCGAACGTGCTGATAGACGGGATTCTGGAGTATACCGTTGCCAGGCTACCAATCCATATGGACGATCTGACCATTTCGTGCATTTAGGAGTTCAAGAGCCACCGGAACCTCCGGCCAATTTCCGGGTGGTGGACGCAACATCTCGTACCATCCGACTACAATGGCGTCGTCCGTTTGACGGGAATTCCCCTGTTCTGGGGTACGTCGTTCAGTACAGGAGACACGACTCCACCAATCCCGATATGAACCCTTGGAGAGATGCTGACACTCACAATGTATCGGTATCCGCTCGCAATGCTGACTCCTATTCCGAAAACGAAGAGGCCACAATTACGGGACTCGAACCAGCGACAGCATACTTGGTTCGCGCGCGTACAGTGACAGCACAATTTTCGTCAGCACAAACGCGAGCCCTTCTGGCTCTAACGCTGCACGAGCCACCAGCTCGTCCACCACTTGGGCTAAGGGCATCTGCTCCACGACCGAGTACTATTGAACTAGCTTGGCAGGCTCCACCCGCATCGTCGTGGAACGGTGAACTTCTTGGGTACACGGTATGGTGGTGGCTATCTGCGGATGGCACCTTATCTGGGGGTGCCAGTGTTGGTATGGAGTTTGCCACTGTTAGAGGGCACGTCACTAAGTATACTATAGCTGGTCTGGAACATTATACACGGTATTCAGTAAGCGTAAGGGCTTTTAATAGCGCGGGTGCAGGTCCCGCGACGCCGCCTGTCAACACTCTGACACAAGAAAGTGTGCCATCTGAAGGCCCTCGTGCAGTGCGATGTAGAGCTGTATCACCTCAAAGCCTCAAGGTAGAGTGGTCTCCTCCACCAGCCCACGCTCATCATGGTGCTCTCCTGGGATACAAGTTGCTGTACCGACCGGAGCACACGACGGGTTGGGTGGAGTGGGAGGCGCGTACCGGGGCTGGTGGAACTCCAGAAGCAGGAGCAGAAGTAAAGAGAGTGGCGGGAGTTGAGACGCTTCTGTTAGCATTGAGACCGCACATGAATTATACTTTGCAAGTGTTAGCGTATACTGCTTCAGGGGATGGTGTACCCGCACATCCTGTGCATTGTACCACACAACAAGATGCACCAGGTCCACCAGCAGCCGTTAAAGTAATTGCTACATCAGTAACGTCCTTGGCAGTTAGTTGGTTACCGCCAAGTCGTCCTAATGGTCCAATTCTTTATTACACTGTTTTTTATCGCGAACTTGGCAGGGATAGACCTCCACAAACTACTACGGTCCAGGCAGAAGAAGGTGAAATGTCGGTCGGTTTAGGAGGTTATGCTGAGATACGTTCGCTATCTGAAACAGCAACTTACGAAGCTTGGGTCGCGGCACATTCTGCTGCTGGAGAGGGAGAACAGTCAGCTCCTCAACCTGCCACAACGACGTCTAGAGCAGCAGTCAGGTTACTATCCTTCGGAGTGTGGGCTCGAGTACAATGTGGACATCCACTCCGACTAGCTTGTGCATGGCGGGGGGAACCGGCACCACGAGCTCGTTGGCTTAGAGGAGACCGACCAGTCACTCATGACCCTAGAACACACCTAACCCCTCATGGGCATTTGGCCATACATGAAGTTGACTCTTCGACTATTGGTAACTACACTTGCTCGGCTCGTAACGCGTTCAGTTCTGAGGAAGAGACATACCGAGTTGAATGCGCATCACCACCATCTGCACCTACCCTCACATTAGAGAAGGTCGATCATACACTTGCCAAGCTGTCATGGCGGACTACTCATCATCCGTCGGCACCACCGCACGGCTTCACTATATATTGGATGCGTATACGAGATGAGTCTGGTGAAGGCTCTGAAGCCACCAGCTCTCGAGGTGAGGATGAACGTAGAATTGAAGCCGGAGGTGAAGCATCAAGCGTTGTTTTGAGATCTTTATCATGTGGTGGAATGTATTCTGTGAAAGCAGTGGCACATTCTCGGGCAGGTTCCTCGTCACCCTCTCCACCACTTTTAGTTCGAACGAAACCACCAGTGCTAGTATGGGAAGGTGGGGGGGATGTCAATCAGGAGGAAGGCGCAGAAGCTGCTGTATGGAGCAATAGTTCTGCTTTGGCATTAGACGCACGTAGGGCTGTCAGTTGCGGAGCCAGACTCGTGAGACTTCAATGGCGCAGGGCTGATACTAGTAGAGCATGGAGAGATGCTGATTTAACATCATTACATCATCACAGAGAAGTAGTCATCGGTGGCCTAACTGTTGGTATGTGGTATGCATTGAAGCTATGGACAGCCACAGATTCTGGAAGACAGCAAACTGTGATTTATGCTGCAACTACAACTCATTCAGGAGAACGACTTCGCCGGCCAGCCTCGTTTCTCTCACCCGGTCCGCCGGTCACACCGCACACAACGGCCGACAACTCAGAGCGCGTGCTCGGCGCGGTGTCACTCGCATTTGCTGCGCTTGCTGCACTTGCAGTTTCCGCGTTACTGATAGTTTTGATGGCGAAAAGAAGCACGCTTCTATCGTGCGGGGGCCAAGTTGATGAGGAAGCTCGTCGCTGCAGCCACACCAGTGGAGATAAGACCGCATGTCCTGAACAGCAGAACATGAGGAACTGCCAACACGACTACAAACATGACAAACTCTCACCTGCCTCCGACGTATACGAGATCAGCCCGTACGCCACATTTGCCGTGGGTGGTGACACAGCCGCTACTCTGGATCACACTCTCCAGTTTCGCACCTTCGGCCACAGGGAAAACGATGCTCCTCCTCACAGGCCTTGCAGGAAACATCCACAAAGACATCGAGAGAGAACTGATGGAGGAGGCGGTAGTACAGGCGGTGGGGGAAGTGTCGGAGGCTTCTCCGAGGTATACGCTGGAGATTCCAGCGTGGAGTCAGGCCCTGGATCTCTCTCGCCACGGACCCATCATGAGCACAGCTAG

Protein sequence:

>DPOGS202291-PA
MRIEWDFEETNEIQENDPDSLLLHKLNQSNQKYSCSVVDGSKYTCDDVKRLMKRDEESIVIWCLLVRGEDSVYLKRERSKRSPDSRPELVYTFIEQAVQPGAQVALKCSAVGEPPPRFRWILDGLPIPAHHGAMVTEGRETGQSGLGSSNYILSTLSLNSARVEHGGRYECRATNTHGSVAHAARLNVYGPPYIRSINPVKAVAGSDITIWCPYYGFPIDSVKWEGGAGNDPRYQQSDGQLTISNVDRTRDKGGWICSVLTPGGELARREVQINVVSPPVLSPIVFPPGLRSGDRAQLTCTVTSGDMPVYFSWLKDQMPISSVLQVDERGAEFYSMLLFKSLTAAHSGIYTCVVTNTAGKANMSAELAIKVPPYWQLEPSDTSVLLNGSLTVSCEARGHPPPHIYWTKFTGSTESALGGVSDPVVLGNGSLRLDSARAEHSGKYRCKAENGVAPALSKTLTIHVNARFETPSVNVTAKLGETVRLACVARGDNPLSMAWSHSGRTLPNSDYRMSISETRSVEGLRSELVIERADRRDSGVYRCQATNPYGRSDHFVHLGVQEPPEPPANFRVVDATSRTIRLQWRRPFDGNSPVLGYVVQYRRHDSTNPDMNPWRDADTHNVSVSARNADSYSENEEATITGLEPATAYLVRARTVTAQFSSAQTRALLALTLHEPPARPPLGLRASAPRPSTIELAWQAPPASSWNGELLGYTVWWWLSADGTLSGGASVGMEFATVRGHVTKYTIAGLEHYTRYSVSVRAFNSAGAGPATPPVNTLTQESVPSEGPRAVRCRAVSPQSLKVEWSPPPAHAHHGALLGYKLLYRPEHTTGWVEWEARTGAGGTPEAGAEVKRVAGVETLLLALRPHMNYTLQVLAYTASGDGVPAHPVHCTTQQDAPGPPAAVKVIATSVTSLAVSWLPPSRPNGPILYYTVFYRELGRDRPPQTTTVQAEEGEMSVGLGGYAEIRSLSETATYEAWVAAHSAAGEGEQSAPQPATTTSRAAVRLLSFGVWARVQCGHPLRLACAWRGEPAPRARWLRGDRPVTHDPRTHLTPHGHLAIHEVDSSTIGNYTCSARNAFSSEEETYRVECASPPSAPTLTLEKVDHTLAKLSWRTTHHPSAPPHGFTIYWMRIRDESGEGSEATSSRGEDERRIEAGGEASSVVLRSLSCGGMYSVKAVAHSRAGSSSPSPPLLVRTKPPVLVWEGGGDVNQEEGAEAAVWSNSSALALDARRAVSCGARLVRLQWRRADTSRAWRDADLTSLHHHREVVIGGLTVGMWYALKLWTATDSGRQQTVIYAATTTHSGERLRRPASFLSPGPPVTPHTTADNSERVLGAVSLAFAALAALAVSALLIVLMAKRSTLLSCGGQVDEEARRCSHTSGDKTACPEQQNMRNCQHDYKHDKLSPASDVYEISPYATFAVGGDTAATLDHTLQFRTFGHRENDAPPHRPCRKHPQRHRERTDGGGGSTGGGGSVGGFSEVYAGDSSVESGPGSLSPRTHHEHS-