Monarch geneset OGS2.0

DPOGS200046
TranscriptDPOGS200046-TA2844 bp
ProteinDPOGS200046-PA947 aa
Genomic positionDPSCF300365 - 2701-55402
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0073690.068.01% 
BombyxBGIBMGA013907-TA0.060.52% 
DrosophilaSP2353-PA2e-5645.19% 
EBI UniRef50UniRef50_F6YZ481e-5627.77%Uncharacterized protein (Fragment) n=2 Tax=Ciona intestinalis RepID=F6YZ48_CIOIN
NCBI RefSeqXP_393275.31e-7639.77%PREDICTED: similar to SP2353 CG8403-PA [Apis mellifera]
NCBI nr blastpgi|3838527666e-8040.14%PREDICTED: pikachurin-like [Megachile rotundata]
NCBI nr blastxgi|3504005778e-8041.34%PREDICTED: pikachurin-like [Bombus impatiens]
Group
KEGG pathwaybta:4448726e-56 
 K06255 (HSPG2)maps-> ECM-receptor interaction
InterPro domain[224-403] IPR0089851.2e-39Concanavalin A-like lectin/glucanase
[491-706] IPR0133206e-37Concanavalin A-like lectin/glucanase, subgroup
[250-386] IPR0017911e-27Laminin G domain
[552-684] IPR0126797.6e-21Laminin G, subdomain 1
[259-385] IPR0126807.4e-20Laminin G, subdomain 2
Orthology groupMCL11712 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200046-TA
ATGGCTGGTGATCTAGCGGTGGTAGCGCTGCTGGCGCTGGTCGCAACGGTCACCTCGCGCGCGCCTACCACCGACCCGCTACTGTTCGAGGCGGCTTTTCAAGGAAGGTGTGGTGAAGGCAGTCCGTGCGAACAGCTGTGCAGAGAACTTCATGACGGGACGTATGAGTGCGGCTGCGGGCCAGGGTTTGTGCTGCATGTAGACGGCTATGGCTGTGTCGAACTGAATTCAACGAAGGCAACGGAGAGCTCATCCGACAAGCAAGAAGATGTGCTTTACCAGAAAGACGTTTCTTTCTCGGCTGAACTAGAAATAGCCCCGTCCAATAGATTTAATAAGAATAATGTAAGCGAAATACATATCGATCAAAGACTAGCAACACTCCCCACTATCCTTCAGACCAAACGAAATTACACGTATACGGGAGAGATTTACAAAGAAGTTGACGGTTTAAACAGTCTCATAGAGGACGGAGTTAGCGAAATAGATTTAAACATCATAGCTGAGAGAGACAATGTGTTCGCTGATAGGAGAGATAAGGTTACTACCTTGGGACCCAGCTTGGAACAAAAGGTGCCAGGCGGTACTTCCTCGCCTTCCAGAACAACAGCAGTTGAGAGCTGCAGTTGCGATTGTACTGAGATAAAATGCGTCTGCCCTGGGAAATGTAAGCCAGCTACGCAAATAGCGACGCCACGTTTTTCGGGTTCGTCATGGCTAGCGCTCCGTGCTTTAAGGGGCGCTTACAAAAGGGTGCGATTAAGAATGAGGGTGCGGCCAGAACGCACCCGTGGAGTGTTATTGCTGACGGGTGAACACGACGATCTATCAGGGGATTACCTGGCGTTGATACTAAGGAATGGACATGTTGAATTAAGATTCGATTGTGGTAGCGGCGCAGGTGTGCTGCGGTCTCCTGAACCAGTTCATCTTGGTAGATGGAACACCATATCTGTTTATCGACACAGATGGGACGCCTGGCTCAAACTGAATAATGGGAAACGAGTGCGCGGGAGATCGAAAGGCCTCTTCTCGCGGATGACCTTCCGGGAGCCTGTGTGGGTCGGAGGTGCTGGTAACACTACTGGTTTGCAGAACAAGCTCGGCCTCTCCGAAGGATTGCTCGGCTGCGTCGACTTCTTGAGAATTAACGGTGACAGTTACCGTCTAGTGAAAGATGCTGTCTCCACACTTGATATCGCTGAATGTGCACCAACCCTCGCGCCGTGGAGACCGAGCGAAGTGACCAGTGAAATACCAAGGAGACCTAAAACCGAATCACATGTATATGACATTAACGATATAGTACACTTTGAGGACAACGACATAGTTATGAGAGACGCCAAGGACCACAATAAACTGGACAATTCCATACATTACAACGTTAAAATGTTCAATGACAAATACGACTTGATTGATGATAAGAAACTCAATGACATTGATAATGAACTGAGTGTTCAGGACTGTAAATGTGAGCATGGGGGAGGTTGTGTGGAACATGGCTGTCTCTGTCCTCTTGGATATGCGGGAGAGAGATGCGAAATCACTTTGGACCTAAAGGTGCCACGCTTCAACGGTTCGTCGTACTTACGATTGCCCGGCCTGGGAAACACAGCGCAATCTTGGCTGGATATTCGGATAACCGTAAAACCGACTAGCGGTGATGGGCTTCTGTTATATGACGCGGAACACCCCAGCGGCGATGGTGACTTCTTCTCCCTTCATCTTCGTGATTTTTTCGTCGAATTCGCCTTCGATCTTGGATCTGGGATCGCTCTTGTGAGATCCGCCTACCCGCTGTCACCAAACAAGTGGCATAGCATATCAATAAGCCGTACGGGTCGTCACGCGTCGATCCGCGTTAGATCTTACGACACGAGCGACGTGACGGATACGACGCGATCGGTGACGTCACGCGGCGCCGCGAGGAGACTCACCCTCACCCAGCCGATGTTGCTAGGAGGCGCGCCCTATCCGTTGCCACAGAGACTCGCCTTGAAAACCTCCTTCAGTGGCTGCGTGGGCAAGTTGGTGATCAACGAGGAGGAGTTGTCTGTAGTTTCCGCTGCTCTCGGCGGCGTTGATGTCGACAACTGTGACGCGCCTCACAACACGTGCACGGACTGCAAAGAGACGTTATACCAAACACCAGAATACCCACGCGAGCTATCAGCCATCCACCACTCTATAGTCGCGAAAAAGGGATTCAAAATCAAAAATCACAAAACCAAAACTAAAAAACATCACGAGAAAAAGAAATATCCTAAGAAGTACGTACAAAATGGCGTGCATATGCAGAACGATATAGACAAGGGTGTTACGGAACAGCCTTACGACGGACGGACATACATGCAAGTGAAATACCTAGACTCCAACGAGATCAACTGGGGGGACACGAACACCTACCCGAGTTTCACGGGAACTGATAGCTTTATACATATAGATGACGAAGAGACTATGAAAAGGTTGCTGAGCTACACCCTGGACATCAACATCCGTTTCCGTTCCGTGTCCTCCAACGGTCTGTTAGTGTGGAGCGGTCGGGTCACACACACACACGCAGAGAACAATATGAACACGAACACACACACAAGCGACTTCCTTTCATTGGCTGTGGAAAACTCCGTGCTTGTATTCAGATACGATCTCGGCAGCGGCGAGGTGGTCATTATAGCGAACCACACGAAAGTGGACGACGGTTTGTGGCACAGAGCGAGAGCAACCAGGAACAGACAAGCAGGTGTTCTGGAAGTAGACGGCTTGGGGTCTGTTGGGAAAATATCACCTGGAAAACTGAAACAACTGAACACCGAGAACGGACTTTATATCGGTAGGTGTTGTTGA

Protein sequence:

>DPOGS200046-PA
MAGDLAVVALLALVATVTSRAPTTDPLLFEAAFQGRCGEGSPCEQLCRELHDGTYECGCGPGFVLHVDGYGCVELNSTKATESSSDKQEDVLYQKDVSFSAELEIAPSNRFNKNNVSEIHIDQRLATLPTILQTKRNYTYTGEIYKEVDGLNSLIEDGVSEIDLNIIAERDNVFADRRDKVTTLGPSLEQKVPGGTSSPSRTTAVESCSCDCTEIKCVCPGKCKPATQIATPRFSGSSWLALRALRGAYKRVRLRMRVRPERTRGVLLLTGEHDDLSGDYLALILRNGHVELRFDCGSGAGVLRSPEPVHLGRWNTISVYRHRWDAWLKLNNGKRVRGRSKGLFSRMTFREPVWVGGAGNTTGLQNKLGLSEGLLGCVDFLRINGDSYRLVKDAVSTLDIAECAPTLAPWRPSEVTSEIPRRPKTESHVYDINDIVHFEDNDIVMRDAKDHNKLDNSIHYNVKMFNDKYDLIDDKKLNDIDNELSVQDCKCEHGGGCVEHGCLCPLGYAGERCEITLDLKVPRFNGSSYLRLPGLGNTAQSWLDIRITVKPTSGDGLLLYDAEHPSGDGDFFSLHLRDFFVEFAFDLGSGIALVRSAYPLSPNKWHSISISRTGRHASIRVRSYDTSDVTDTTRSVTSRGAARRLTLTQPMLLGGAPYPLPQRLALKTSFSGCVGKLVINEEELSVVSAALGGVDVDNCDAPHNTCTDCKETLYQTPEYPRELSAIHHSIVAKKGFKIKNHKTKTKKHHEKKKYPKKYVQNGVHMQNDIDKGVTEQPYDGRTYMQVKYLDSNEINWGDTNTYPSFTGTDSFIHIDDEETMKRLLSYTLDINIRFRSVSSNGLLVWSGRVTHTHAENNMNTNTHTSDFLSLAVENSVLVFRYDLGSGEVVIIANHTKVDDGLWHRARATRNRQAGVLEVDGLGSVGKISPGKLKQLNTENGLYIGRCC-