Monarch geneset OGS2.0

DPOGS206478
TranscriptDPOGS206478-TA3342 bp
ProteinDPOGS206478-PA1113 aa
Genomic positionDPSCF300070 + 545677-563193
RNAseq coverage24x (Rank: top 78%)
Annotation
HeliconiusHMEL0127140.091.56% 
BombyxBGIBMGA005365-TA0.087.40% 
DrosophilaCG32432-PA2e-10530.88% 
EBI UniRef50UniRef50_E2AVC00.050.60%Cubilin n=5 Tax=Formicidae RepID=E2AVC0_CAMFO
NCBI RefSeqXP_968153.10.046.79%PREDICTED: similar to CG32432 CG32432-PA [Tribolium castaneum]
NCBI nr blastpgi|3800156990.052.81%PREDICTED: uncharacterized protein LOC100869581 [Apis florea]
NCBI nr blastxgi|3800156990.052.96%PREDICTED: uncharacterized protein LOC100869581 [Apis florea]
Group
Gene OntologyGO:00055151.2e-09protein binding
KEGG pathway 
InterPro domain[255-400] IPR0008594e-17CUB
[63-101] IPR0021721.2e-09Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL10279 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206478-TA
ATGGATTTGTTAAGAAATATCTATTGTATCTCAAATTATAATAGAACTATATTTTTCATGATAGTTTTGTTTTCTTTCAAACTGTGTAGACCTCAGGACTTAAGTGAAGACTTTATTAATGAGGATAGTTTAGACTCTGATATTATGTTACCGGTGAAAGTTAAGGCGAGAAATGTGGCAGAGAGTCCGTGTAGACTGAGCGAACTGCTATGTGATACGGGACAGTGTATATCGATGGATAAGTACTGTAATAGAGAAGATGACTGCGGAGATAAAAGCGATGAACCAAAATCGTGTACACCTTGTAACAGAACATACATGGGAGATGTCGGCCGTACTTACGAACTGGAGGTTCGTAGGCCGAGAGAGGATCATCTTCCATTTGTCTGTCATCTAAATTTTACTGCTAATGGCGGCAACTATGGCGATATTATACAGCTTACATTCGACACTTTCACGGTGGGTAAATTTGTATCGTTTACTTCTGATGGATGTCCGGATGGACACATGACAATAGTTGAGAGAAGTTCATCACCACCGATGGGTCAATGGTGTGGCTCAGCATGGGGTTATACTGTATACTTTAGCGAATCAGACTCTATAAATATGACTCTCCGTTTGGATAGACTGAGCCAGCAGGGTGTTGGGTACAATTTCGACTTCAAGTTGGCATACAAGTTTCTAAGACGAAGCGAAGCTAGATTGCGGTACGGTAACGCGACTGTGGGGGCATGGAGGGGGGAACGAGTCTCTGGCACCTATTGCGACCGCATACTGAGCGACTGTGATCTACGCGCATGCCGCATCCAGTCACCAAACTTTCCTGGAGTATATCCACGTAATGCGACGTGTACATACCGCATTGAACACACAAAGATACCAGCAGATAAACATGTTCTTTTGGCCGTAAGACAAACAAATAGTCACAAAATACATATCAAAGATCAAATAGTCAAATATGATAGAAGCCAACGAGTTTTAAAGATTTGGGACCAATGTAATGTCGTTCAGGATTATCTAACAGTTTGGGATGGACCTACAAGAGACTATCCGGTTCTAGTTAGACTTTGTGGAGGAGACGCAGTTCCTGACATAGTTAGCAGAGGGCCTAATATGTTACTAGAGTTTCATACCTCACCTTATGATAATCCTTTTCATCCAGTTCCGCTCAGTTATTTACCTGGTTTTGAGCTTGAAGTTCAGGTGTTGTACGTGGACAGAGATTCCCATTCGTACGTGAGTTCAGATGGTCGTTGTCGGTTTGTTTTACGCTCCTCTGACAAGACAAGCGGGGTGTTAAGAAATCCACGACACTCCTTGCCACCAAATACATCTTGTGTGTATTACTTTCAAGGTCGTCCAAACGAAATAGTATGGGTATCGTTTGTTAAGTACCACGCTGCGGGTTCGGAGCCGGCGGGATTCGATCAACAAAAGGATTGCTCTTCACAACTCACTATTTGGGATGGTGCGGCACCTGATGCAGATCTTGATAGAAAGTTGGAAATGAGTGACAAGAAATCTCTTCTCGGTTCATTTTGTCGAGAGGAATCTCCGCGTCTGTGTGACCATGCGCTTCTATCAAACGCTACGCGCGCCACCAGGCCTTGTGCTCCCACAGAAAGCTACATCACAACAGGACCAGCGCTTACTATTTTGCAGGAGCTACGTCAAGGTTCAGCATTATATCCTGTCTCTTTCGTCTTGCGATACGAATTCGTAGATGTGAGTGAGCAAGGTCAACCATTAGTGGACTCTCAGTCAGCGTGTGATAGAGTTTTTAAGTCAGCACTCACATATTCTGGAAGATTTCAAGCTCCCCGAGCTATATTTTACTATGGGCGAGGTGGATCTCAAAATTTGACTTGTATTTTAAGATTTGAAGCTAAGCATGGAGAAAGAATACAGTTAACGTTTACAAACACATATTTTGGAAATAAAATTTGTAGTACTCATAAAGATTCCAAGACAAGCCGATGGGTTTGTGATAGGCCAATTAAAAGAATAATAGGTGGGGAAGGTTTAGCTCAGATTATTATAACTGAATATCCTTGGGAAGGGATCCCTATACAAAGAGATTGTTTATGTACAAATCGCTCCGAACCTCTTAACGTTCACACGCTTACAGCTCCCGTCGTAGAAGTTAATTTTACTGTGACTATGATGAACATAACTGAAGATTATGACGATTTTCAATTTGAAGGTGAATATAAATTTATTCCCACTGGCCCCGGAGACGAAAGTGTTTGTTCTACTGGTTGGGGTGATAGACGATTAAGAGGAAGCAGTGGGGAAATAAGATTGTATGACAACAGAGAAGCGTCTGTAACTCCAGAAATAATGGGTGACAGAAATGTTATATCAGAAAGCGTCAGAGCAGAAGTTGCTTGTGTTCATAGACCTTGGTTGATAGAACCTGGAGGGGATGATGTAACACCTTTACAGGGTAGATATCTATATATCAAAGTACCGGGATATGAAATAACGCCTACCTCACCATTTTGTCCTACACCAAATCGACTATTTATATACGAAGCACGAGATACTTCAATTCACAGAGAAATTTGTCCTAAAGATTCTAACACTTTAGATTTGTATTCGCCAGGATGGAAATCATCACAAACAACAATAGAATCATCATTAAAACCTCATGCTAAGAGCTACGTAATTGATTTTCTACAACACGAACAAACCGATTACTCAATCAAATGGATAGAAGTTATGAAAAAGCCATCTATTGACCATGATCCAGGGTCAAACATTTTGCCTTTATCGGTTAATTTAGATTGCCATCATAACTGCCCAGAGTTGAATGCCTGTATTCCTATTACATTATGGTGTGATGGCAGTCCTCATTGCCCTTCTGGTTACGATGAAGATGACTCAAACTGTTCTTTTAAGTTGTCCTTGCCGTCACCATATGTTGCGGCAGTGGCCGGTATGGGACTTCTTATTTGTGCTATCGCAATTGGTTTATGTGCATGTAAACGACGAAGAAAAAAAGATAAGGAGTTTAAGGCAAGACTCGACGACGCGCTTCCACCTGAAGAAAGACCTTATGATCGTTCAAAAAGTAATGGCGTTCCAGAAGCTAATCGTCAATACGCAACTGTACAAAAATATGCTACAATAGATAAATACAGTTTAAGTCAAAAATATAGTGCGGGGCTCAATGATGTTAGGTATTATGATGAAGTAGCTCAAAAAGATAAGCTAGCCGACACGAGGTATGCTAGCTTAGGGCGCGCTGGTCGGTGTAATCGAATGGAAAATAATAGAGGCACTGGATCTAGAAGAATGCCAGACGTAGGCTATCCAGATCTAAAAGACGGATTTTGTTGA

Protein sequence:

>DPOGS206478-PA
MDLLRNIYCISNYNRTIFFMIVLFSFKLCRPQDLSEDFINEDSLDSDIMLPVKVKARNVAESPCRLSELLCDTGQCISMDKYCNREDDCGDKSDEPKSCTPCNRTYMGDVGRTYELEVRRPREDHLPFVCHLNFTANGGNYGDIIQLTFDTFTVGKFVSFTSDGCPDGHMTIVERSSSPPMGQWCGSAWGYTVYFSESDSINMTLRLDRLSQQGVGYNFDFKLAYKFLRRSEARLRYGNATVGAWRGERVSGTYCDRILSDCDLRACRIQSPNFPGVYPRNATCTYRIEHTKIPADKHVLLAVRQTNSHKIHIKDQIVKYDRSQRVLKIWDQCNVVQDYLTVWDGPTRDYPVLVRLCGGDAVPDIVSRGPNMLLEFHTSPYDNPFHPVPLSYLPGFELEVQVLYVDRDSHSYVSSDGRCRFVLRSSDKTSGVLRNPRHSLPPNTSCVYYFQGRPNEIVWVSFVKYHAAGSEPAGFDQQKDCSSQLTIWDGAAPDADLDRKLEMSDKKSLLGSFCREESPRLCDHALLSNATRATRPCAPTESYITTGPALTILQELRQGSALYPVSFVLRYEFVDVSEQGQPLVDSQSACDRVFKSALTYSGRFQAPRAIFYYGRGGSQNLTCILRFEAKHGERIQLTFTNTYFGNKICSTHKDSKTSRWVCDRPIKRIIGGEGLAQIIITEYPWEGIPIQRDCLCTNRSEPLNVHTLTAPVVEVNFTVTMMNITEDYDDFQFEGEYKFIPTGPGDESVCSTGWGDRRLRGSSGEIRLYDNREASVTPEIMGDRNVISESVRAEVACVHRPWLIEPGGDDVTPLQGRYLYIKVPGYEITPTSPFCPTPNRLFIYEARDTSIHREICPKDSNTLDLYSPGWKSSQTTIESSLKPHAKSYVIDFLQHEQTDYSIKWIEVMKKPSIDHDPGSNILPLSVNLDCHHNCPELNACIPITLWCDGSPHCPSGYDEDDSNCSFKLSLPSPYVAAVAGMGLLICAIAIGLCACKRRRKKDKEFKARLDDALPPEERPYDRSKSNGVPEANRQYATVQKYATIDKYSLSQKYSAGLNDVRYYDEVAQKDKLADTRYASLGRAGRCNRMENNRGTGSRRMPDVGYPDLKDGFC-