Monarch geneset OGS2.0

DPOGS203872
TranscriptDPOGS203872-TA2445 bp
ProteinDPOGS203872-PA814 aa
Genomic positionDPSCF300398 + 79939-106183
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0059893e-11659.77% 
BombyxBGIBMGA013290-TA1e-4641.01% 
Drosophilakirre-PA3e-16341.69% 
EBI UniRef50UniRef50_Q17HA93e-16443.24%Nephrin n=3 Tax=Endopterygota RepID=Q17HA9_AEDAE
NCBI RefSeqXP_975294.10.049.32%PREDICTED: similar to nephrin [Tribolium castaneum]
NCBI nr blastpgi|910790920.049.32%PREDICTED: similar to nephrin [Tribolium castaneum]
NCBI nr blastxgi|910790920.047.53%PREDICTED: similar to nephrin [Tribolium castaneum]
Group
KEGG pathwayecb:1000538444e-17 
 K12567 (TTN)maps-> Dilated cardiomyopathy
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[196-290] IPR0137834e-15Immunoglobulin-like fold
[188-274] IPR0131621.2e-14CD80-like, immunoglobulin C2-set
[85-180] IPR0035994.5e-11Immunoglobulin subtype
[80-162] IPR0130982e-08Immunoglobulin I-set
Orthology groupMCL10226 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203872-TA
ATGGCAGCAACAGATACAGCCGGTGTTAATCTCAATAAGGACAAGATTGATGATCTTGAAGAGAAGATAACAGAGATTGTAATCAGAGGAATGAAATTCAGTACGTCAAGGGTAACGAGTAGGAAGGCAAATCCATTGTTCCTTTTGCTGCGATTATTGATTAGCTATATCACAATAGCCAAGCATTGTATGGCGTTGATCTTATTAATGCACGCAGTGAATGCGTACAGTCCGCAGAAATTTGCTATCGAACCGCAGGATCAGAGTGCGGTGCTCGGGTCAAGGGTAACTTTGCCGTGTAGAGTTATTAACAAAGCTGGTCAACTTCAGTGGACAAAAGATGACTTTGGTCTTGGAACTCACCGCCATTTGACAGGATATGAGAGGTACAAAATGGTTGGGAGTGACGAGGAAGGAGACTATTCATTGGATATCAGTGACGTCACTATTGATGATGACGCCAACTACCAGTGCCAAGTCAGCACGGGACCGAAAGGGGAATTAGCTATACGGTCGAAATATGCTCGTCTTACCGTGTTAGTTCCTCCCGAACCACCGAAAATTTTGAAAGGACCTACGATAGAGGCTGTTGAAGATAGGGAAATTATGTTGGAATGTGTATCAGTGGGGGGAAAGCCAGCAGCTGAAATAACCTGGGTTGATAGTGAGGGTAATGTCCTTAACCAGGGAGTTACTTACACTATAGAACAGATGTCAGACGGCAGGAGGTTTATTGCTAGGTCCGTCTTGCACCTTCGTCCTCGCAGACAGCACCACTCCCACACGTATACTTGCCAGGCGCAAAATACCGCAGACAGAGCGTATAAGGCCGCCAGCGTCGTACTAAAGGTACAATATGCACCTAAAGTAAGAGTTGTCGTGAGATCAAGACCAAACGGTAAAATACAGGAAGGAGACACTTTAGTCGTAGGATGTCAAGCCACTGCTAACCCAAGTAACATAATTACGAATGTCTCTCGAAAGCACAACGAAGCTACAATCAAATGCGAGGTTCAGAATGAAGTTGGCAGCAGTGCCGATTCTAAATCACTAGACGTATCCTATGGTCCAGCGTTTAAAGTTAAACCGCGAAACGCGGAAGGTGACATCGGATCCGTGGTAACTCTAACATGTTCGATAGAAGGTCACCCGCAACCAAAACTTCTGTGGTTAAGATACCAACAAGATAGAGTCATTAGGGTAGGTAAATCGTCGAATTTAACAATAACAATAAACAAAGAAACGGCTGGTCAATATTGGTGCAGAGCGAGCGTAGAAGGCTATCCAGATATAGAGTCACCTGCTATGGTCTATGTCAAAGGTCCTCCAAGAATTATTTCTAATCAAACTCAATATGGCGTAGAAGGTGGTAGTGTGAGGATCGAGTGTGTAGCTTTTTCTGTCCCCAAACCTGATTACATTATATGGTCATTTGGTGGCAGTGAAATAAACTCATTTCACAATCATGAATATGCGTTTCTTGAAGAGTCTCTACCAGATGGTTTAACCAAATCTTCTCTTGTTATAAGAGAGAGCGAAACAAAACACTTTGGCAGCTACAATTGTAGCGTTTCTAATGCCTACGGACTTGACAGTCTGGAAATACACCTAATACCAGACAAAACGATGCCTTTAATAATATTTGTTATCGGTGGCTCGGGAGTAACGATACTTATTTTGATCATTATGTTAATTGTGATGCTATGTCACAAGAACTCGAATAAGAAAGAAAAGGACGAAAACGTTACAGAAATATCAAAAGAAGATAAATTCAAAGATGGTGACAGTTCGAACATAAGTGACTTAAAACTGGAATTAAGACAAGTAGAAGTAAACTGTGAAATGGATCCATCAACCGGGCCTGATTTGGACATGCGTTCAGCTCTTCAACTAACCAGCAATCTTGGGTTACCGCTGGCAGGAGCTGTGCACGATCACGTGTATAGGTACAGTGACGAATTCAATGTTCATGGATTTAAGAATCACGATCAAACTAAAGGCTACGTCCCATACGTCGACTACTCTAGAGATTACGCTCCTCCCACTAATGATTCGATGACTGGATCTTTATCCAGAAGCACTGATGAGTCCACTTATCAGAGTCACTGCGGATCTTTGAATCGCCAAGAGAGCTGTGGCAGACTAGGAGGATTGGTAGGACCGGATGTAATACCGATGGCAAATTCAGGAGTGGTTATGACAGGAGTCGACGTAAGATATGCGGCGACTTATGGCAATCCTTATTTGAGAAGCAATGGAGTTGGTTACGTTCCACCAGTTGCTAATTCTTCGAAAAATGCACCACCTCCTTATTACACGCTGCGAAATACAAATCAACAACCCAGTCCGTCTATGTCTTCGTCAATAACGAATTCTTTTACCAGCCAGATCACATCTTTACCTAATAATGCTCAACCACAAGTTCCGGAACTCACGTGTAAATGA

Protein sequence:

>DPOGS203872-PA
MAATDTAGVNLNKDKIDDLEEKITEIVIRGMKFSTSRVTSRKANPLFLLLRLLISYITIAKHCMALILLMHAVNAYSPQKFAIEPQDQSAVLGSRVTLPCRVINKAGQLQWTKDDFGLGTHRHLTGYERYKMVGSDEEGDYSLDISDVTIDDDANYQCQVSTGPKGELAIRSKYARLTVLVPPEPPKILKGPTIEAVEDREIMLECVSVGGKPAAEITWVDSEGNVLNQGVTYTIEQMSDGRRFIARSVLHLRPRRQHHSHTYTCQAQNTADRAYKAASVVLKVQYAPKVRVVVRSRPNGKIQEGDTLVVGCQATANPSNIITNVSRKHNEATIKCEVQNEVGSSADSKSLDVSYGPAFKVKPRNAEGDIGSVVTLTCSIEGHPQPKLLWLRYQQDRVIRVGKSSNLTITINKETAGQYWCRASVEGYPDIESPAMVYVKGPPRIISNQTQYGVEGGSVRIECVAFSVPKPDYIIWSFGGSEINSFHNHEYAFLEESLPDGLTKSSLVIRESETKHFGSYNCSVSNAYGLDSLEIHLIPDKTMPLIIFVIGGSGVTILILIIMLIVMLCHKNSNKKEKDENVTEISKEDKFKDGDSSNISDLKLELRQVEVNCEMDPSTGPDLDMRSALQLTSNLGLPLAGAVHDHVYRYSDEFNVHGFKNHDQTKGYVPYVDYSRDYAPPTNDSMTGSLSRSTDESTYQSHCGSLNRQESCGRLGGLVGPDVIPMANSGVVMTGVDVRYAATYGNPYLRSNGVGYVPPVANSSKNAPPPYYTLRNTNQQPSPSMSSSITNSFTSQITSLPNNAQPQVPELTCK-