Monarch geneset OGS2.0

DPOGS215248
TranscriptDPOGS215248-TA3558 bp
ProteinDPOGS215248-PA1185 aa
Genomic positionDPSCF300047 - 434023-445885
RNAseq coverage814x (Rank: top 16%)
Annotation
HeliconiusHMEL0139916e-7844.31% 
BombyxBGIBMGA008812-TA4e-8859.34% 
Drosophilasec31-PB3e-15636.28% 
EBI UniRef50UniRef50_E0VNA30.034.38%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VNA3_PEDHC
NCBI RefSeqXP_973673.10.037.90%PREDICTED: similar to vesicle associated protein, putative [Tribolium castaneum]
NCBI nr blastpgi|910879950.037.90%PREDICTED: similar to vesicle associated protein, putative [Tribolium castaneum]
NCBI nr blastxgi|910879950.038.34%PREDICTED: similar to vesicle associated protein, putative [Tribolium castaneum]
Group
Gene OntologyGO:00055151.8e-43protein binding
KEGG pathwaytca:6624880.0 
 K14005 (SEC31)maps-> Protein processing in endoplasmic reticulum
InterPro domain[10-325] IPR0159431.8e-43WD40/YVTN repeat-like-containing domain
[12-330] IPR0110461.5e-40WD40 repeat-like-containing domain
[239-279] IPR0016801.3e-09WD40 repeat
[242-279] IPR0197811.4e-08WD40 repeat, subgroup
Orthology groupMCL11293 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215248-TA
ATGAAGATCAAAGAGTTAAAACGGACGGTGAACATGAGCTGGTCGCCAGCGGAGCTCTATCCCTCTATGTTGGTTACCGGCTCTGCGGCACAGCAAGTTGACGCTTCCTTCAGCTCTAATGCTAGCTTAGAGTTATATTCTCTTAATCTTGGTGACCCTACCTACGATTTGGAACTTAAATCTAGTATGCAAACAGAACATAAATTTCAGAAATTAGTGTGGTCGGGCGCTGGAGTGATTGTTGGTGGATGTGATGGTGGACTGTTGGAGTTTTATAATGCGGAGAAACTTCTCAAGAACTCCTCAGAAGCCTTTGTTGGTAGCAGTACTAAACATACAGGCCAAGTGTCGGCGCTGGATATTAACCCCTACCAGAAGAACCTGTTGGCTTCCGGAGCCTCTGACAGTGAGATCTTCATCTGGGACCTCAACAACACCAGCCAGCCCATGGCGCCGGGCGCGAGGAGCGCGCCACACGATCATGTTCAGGGTCTAGCGTGGAACCAACAGGTCCAGCACATTCTGGGTTCGACTTTCGCAACACGATGTCTCGTTTGGGATTTGAGGAAGAATGAACCGATAATGAAACTCAGCGACTCCCAGTCCGGTAGTCGCTGGCGCGCCCTGGCCTGGCACCCTCAGGTCGCCACTCAGCTGTGCGTGGCGTCCGACCACGACCACGCGCCCGTCTTACAGCTGTGGGATCTCAGGTTGGCGGCGTCTCCGCTGGTGACGCTGGAGGGCCACGAGAAGGGCGTGCTGTCCTTGAGCTGGAGCAAACACGACGAGGACCTGCTGCTGTCCGCCGGCAAGGACGGGAGCGTGCGCGTCTGGAACCCTGCCAACACCAAGCCGGGAGGGGAGATGGTCCTGGAGGTGTGCCGTCAGTCGGGCTGGGTGCTGGACGTGTCGTGGTCGCCGCGGACGCCCGGCCTCCTCGCCGCCGCCTCCTTCGACCAGACCCTTTCAATCAACGAACACTCCAATTTTCCTGGTCCATTTCAGAGCGCTCAGGGTCAGAGCGACATAATGGATTCTTTCGGTGGGGCGGAGTCGTTCCTGTCCCTGCCGGTGGTGAGTCCGTCCGCCCCGCCGCCCGCCCCCGCCCCGCAGGCTCACAGACCACCCAGATGGTTGAAGCGACCCGTCCGGGCCAGGTTCGCGTTCGGTGGTAAGTTGGTGTCGTTCGAGCGTTGCCGGCGCGAGGCGGGCGCGCAGGAGACGGTCTACATCAGTCAGGTGGTGAGCGAGCCGGAGATCGTGGAGAAGGCGATGGAGCTGGACAAGGTCATCGGCCTCACGCTCAGCCAGGAACCCGACGCGACACACCGGCTGGCCGAATATTGCCGTGAGAAGGGAGACGCGGCCGTGGAGCAGAGCGAGCGCTACGAGTGGTTCTTCCTACGAGCGAACTTCCTGCCATCATACCGGACGGAACTGCTGAATCTGTTAGGGTTCAAACAGGACGAGATATCGTCGAGGTTCAAAGGCCTAGCCGTTAACAACGAGGGCCGAGCAGCGGACGCCCAGGGACTCAGCAGGGACGCGCAGACCTTGATAGAGAGGAAGCTGTCTAGTGTGGAGCTGGAGCCGAGCGTGCGGGACGTGGTCATCCCCAACGGAGACGACCTCACGAGTGAGATCTGTCGCTCGCTGGTGATGGGTCAGCTGGAGGAGGCGGTGGAGCTGTGTCTGGAGGACGAGCGGGTCGCCGACGCACTCGTCATCGCCTCGCTCGGAAGCCAAGAGTTGTTGTACAAGGTCCAGCGGTACCATCTGTATCGCACGTGTAACTCGCCCGTGTCGCTGGTGGCGGGCGGTCTGCTAGGCGGGCGTTGGGCGGCGCTGGTGGCGGGCGCCTCTCCCTCGTCCTGGAGGGACGTGCTGGCCGCCCTCCTCACACACTGCCACGGAGAGAGCTTGCAACACTACTGCGAGATGTTGGGTGACAAGCTATCATCGTCATCCGAGGCGAGTCTTCGGGAGGCGGCCGTACTGTGTTACACGTGTTGTGCGTGCGGGGAGCCGCTGTGTAGACGAGCGCTACGATCCAGCCGGAGCCCCGCAGACCTGGCAGCCGCCGCGGAGCGAGCGCTGTTGTTGCGTCGGGCAGCCGCGGTCAACGGTTCCGCGGCCGGCGGGTCCGCGGTGCCCGGGGGGTCCTCTGCCGTGGACGTGTTGCTGGAGGAGTACGCAGCGAGGCTCGCGGCTCAGGGCTGCCTGAGGAGCGCCCTGGCCGCACTGCAGGGAGCCAACACTACGCTCAATGATAGACTGGAGGTGGCTCTAAGGATGAAGAATCAATATCACAATCGTCAGCAGCCGTCCGGCTCCCAGCAGACCGGCCCTCGCTCCCGGACCGTCAGCGGACACACGCACACACAGCCGCGCGGACAGTACACACACACATACACACACAACGACAGCCACGCCTACAACCAACGTGGAATATCCGGGAGATCTAAATACAAAGTGGATCCGTCAGTGCAGGCGGCTCCTCTCTACAACCAGTATAGCTTCAACAACCCCGCTCCCCCGCAGCCGTCATACGGCTATAACAGTCCTCTGCCGGAACAGTACGGGTCCCCCGCGCCCGTCAATAACTTCGCTCCCATCAACAGACCTAACCCGGTCCCGTTAAACCCGGCTCCACTAAACCCGGCTCCACTAAACCCAGCTCCACTAAACCCAGCCCCGCTCAGCCAGCCTCAGCCTGAGCCCATGTCCATGTCTCAGTACGCTCCCCCCCGGCCCGCCGCCCCCGGCTGGAACGACCCCCCGATGGTCACCAACACGCATAAGTTCGATGTCGGCGACATGTCGGCCTCCCACGCCGGCACCAGGCACTGGACACAGCTCCTACAGAGTAGACGGTCGGATGAGCCAAAGCAAGAAGTTCAGCAGCAGGCGCCCATAACACATCCGTTGTTCGGAGTGGAGCCTCCGCAGCACGTGCCTCTGGTGCCGGCCCCGGGACAGAATCACTACCAGCCACAGAACCAGTTCCCTCCGGCCCAGTACCCCGGACAGTTCCAAGGACAGTATCAAGGGCAACCGGCTCACTTCCCGGGACAGCAGGACCAATTCTCGGGACAACAACCTCCTGACCAGTACCAGCCGCAGTACCCTGGCGGATATCAACAGAACTACGTGCAGCAGCTGCCTCCTCAGGCCGCCGCCCCCCCGGCCTCGGGTCCCCCGGCGCCGCCAGCCCCGGTACCCAAGCCCCCGCTGTCGGCGGACCACGCCCCCATACAGACCGCCTTCGACGAACTCCACCGCGTGTGTCTCGAGCGAGCACACAACACACAAATCAAGAGGAAGCTGGAGGACGTGCAGCGGCGGCTGGAGACGCTCTACGATATATTACGCGAGAACAAGCTGTCCCCGTCAGCGCTGTCGGCGCTTCACACGAGCGCAGCGCTGGCCGGTCGCGGGGAGGCGTCCAGCGCGCTGCAGGCGTGCTCGGAGCTGGCGGCCGGCAGTGACTTCGCCGCGGCCGCGTCCTTCCTCCCTGGATTGAAGATGTTGTTCCTCCTGGCGGACCAGCTGCGGTAG

Protein sequence:

>DPOGS215248-PA
MKIKELKRTVNMSWSPAELYPSMLVTGSAAQQVDASFSSNASLELYSLNLGDPTYDLELKSSMQTEHKFQKLVWSGAGVIVGGCDGGLLEFYNAEKLLKNSSEAFVGSSTKHTGQVSALDINPYQKNLLASGASDSEIFIWDLNNTSQPMAPGARSAPHDHVQGLAWNQQVQHILGSTFATRCLVWDLRKNEPIMKLSDSQSGSRWRALAWHPQVATQLCVASDHDHAPVLQLWDLRLAASPLVTLEGHEKGVLSLSWSKHDEDLLLSAGKDGSVRVWNPANTKPGGEMVLEVCRQSGWVLDVSWSPRTPGLLAAASFDQTLSINEHSNFPGPFQSAQGQSDIMDSFGGAESFLSLPVVSPSAPPPAPAPQAHRPPRWLKRPVRARFAFGGKLVSFERCRREAGAQETVYISQVVSEPEIVEKAMELDKVIGLTLSQEPDATHRLAEYCREKGDAAVEQSERYEWFFLRANFLPSYRTELLNLLGFKQDEISSRFKGLAVNNEGRAADAQGLSRDAQTLIERKLSSVELEPSVRDVVIPNGDDLTSEICRSLVMGQLEEAVELCLEDERVADALVIASLGSQELLYKVQRYHLYRTCNSPVSLVAGGLLGGRWAALVAGASPSSWRDVLAALLTHCHGESLQHYCEMLGDKLSSSSEASLREAAVLCYTCCACGEPLCRRALRSSRSPADLAAAAERALLLRRAAAVNGSAAGGSAVPGGSSAVDVLLEEYAARLAAQGCLRSALAALQGANTTLNDRLEVALRMKNQYHNRQQPSGSQQTGPRSRTVSGHTHTQPRGQYTHTYTHNDSHAYNQRGISGRSKYKVDPSVQAAPLYNQYSFNNPAPPQPSYGYNSPLPEQYGSPAPVNNFAPINRPNPVPLNPAPLNPAPLNPAPLNPAPLSQPQPEPMSMSQYAPPRPAAPGWNDPPMVTNTHKFDVGDMSASHAGTRHWTQLLQSRRSDEPKQEVQQQAPITHPLFGVEPPQHVPLVPAPGQNHYQPQNQFPPAQYPGQFQGQYQGQPAHFPGQQDQFSGQQPPDQYQPQYPGGYQQNYVQQLPPQAAAPPASGPPAPPAPVPKPPLSADHAPIQTAFDELHRVCLERAHNTQIKRKLEDVQRRLETLYDILRENKLSPSALSALHTSAALAGRGEASSALQACSELAAGSDFAAAASFLPGLKMLFLLADQLR-