Monarch geneset OGS2.0

DPOGS213644
TranscriptDPOGS213644-TA5292 bp
ProteinDPOGS213644-PA1763 aa
Genomic positionDPSCF300165 - 50628-56912
RNAseq coverage307x (Rank: top 37%)
Annotation
HeliconiusHMEL0045920.056.78% 
BombyxBGIBMGA004585-TA0.051.31% 
Drosophila% 
EBI UniRef50UniRef50_G3LSH90.051.50%Vitellogenin n=1 Tax=Cnaphalocrocis medinalis RepID=G3LSH9_9NEOP
NCBI RefSeqNP_001037309.10.051.14%vitellogenin precursor [Bombyx mori]
NCBI nr blastpgi|2848084760.054.40%vitellogenin [Actias selene]
NCBI nr blastxgi|2848084760.054.57%vitellogenin [Actias selene]
Group
Gene OntologyGO:00053195.4e-211lipid transporter activity
GO:00068695.4e-211lipid transport
KEGG pathway 
InterPro domain[29-721] IPR0017475.4e-211Lipid transport protein, N-terminal
[394-753] IPR0110301.5e-83Vitellinogen, superhelical
[29-306] IPR0158168.4e-50Vitellinogen, beta-sheet N-terminal
[28-321] IPR0158193.2e-49Lipid transport protein, beta-sheet shell
[778-1033] IPR0152552.9e-24Vitellinogen, open beta-sheet
[1433-1599] IPR0018462.1e-21von Willebrand factor, type D domain
Orthology groupMCL11031 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213644-TA
ATGAAGCTGTTGGTCCTAGCGGCCACTATTGTGGTCGTTTCATCCGGTCAACTTAGTGACGTTGTTGTGGAGTCGCCATGGCCCTGGCAAGTCGGAAAATTATATCGCTACGATGTGGAAACACATACCCTGGCACGTTATTTAGACAGTTTCAGTTCGGGCAACGCTTTCAGAGCCAAGTTTACAGTGCGAGCTAAGTCAGACGGCTACCTACAGGCACGGTTGGAGAATCCAGAATATGCCAAAGTCTACCAAAAGCTAGAACAACACGATCCCATGCCAGAAGATCTGAAATACGTACCTGTGGCAAATTTGGACAAACCGTTTGAGATTTACATAGAAGGTGGAAGAATACTTTCAGTAAAATTACCATTTTCTGTAACACTCATGCAAGAAAACTTGATTAAAGGTCTAATCGGTTCGCTTCAAGTAGATCTCACCAGTCATCGTAATGTAAAAAGCTCCCATGACACGTACGATACTCAAGTACAACAGGGATTATTCCGCAAGATGGAAATCGATGTGACGGGCGATTGTGAAACTCTTTACACAGTATCTCCTGCTGCGTCTGAATGGAGACGTGAACTTCCCAGTTTTGCTTCCGACGATGAACCAATTGAAATAACTAAAAGCAAGAACTACGGTCATTGTCATCATCGTGTAGATTATCACTTCGGTGTACCGGAAGGAGCAGAATGGTCTGGCACTGCCCACAAAACTGGAAAGGAACAATTCATAAATCGTGCTACGGTCTCAAGAATGCTGGTTGGGAAGAATGGTCACATATACAAAGCTGAAACAACCAGTACGGTTACTGCTCATCCCCATTTATATGGGGAACAAAAGGCACAGGTACACGGCAAAGTGCGTTTTAATTTAATGTCGTATGAGGATGATAATGAACCGGCATGGGTATACCCCGAAGGTGCGCGTGAAGTTACCAACTTATTATATGCTTTGACGGCAAAACCAATTGATATCGGTGATAGTTCGTCGTCTGAAAAGTCTATAAAAATTGAGAAACATCCGAGGCAACGTCGCTCCAGTCGTATGAAATCTTTCGTCTCCATAAATAAGAAGATTGTTACTGAAACACATGGATCTTCCAGTTCCAGTGAATCAGACTCAGTATATGTAAATGATGACATTCCCAATATCAACGAACCCGCCTATGCTTCTCTCTATATGAACCCAGATCTTCATGGTGATAAGAAACAGAATCCCATGAATGCTCAGAAGCTTTTACAAGAAATCGCCCAACAATTGCAAAATCCGAACAATATGCCGAAAGCGGATACCTTATCCAAATTTAATATTCTAGTTCGTGTCATCGCCAGTATGAGTTATGGACAGCTCGGTCTGACAAGCCGCAGCATTGAAATTGCTAAGTTGGCTAATGATGTCGTGAAGTCTAACATGTGGATGATCTACAGAGATGCTGTCGCCCAAGCCGGTACTCTGCCCGCATTCCAACAGATAAAGGCTTGGATTGAAAGCAAAAAATTAGAAGGAGAAGAGGCGGCGGAAGTTATTTCCGTGCTTGCAGTATCTCTAAGGTATCCCACGAAGGTGGTCATGAAACAATTCTTTGATCTCGCCATGAACCCCGAGGTAACTAAACAGATGTTCCTTAATGACACTGCACTAATCGCTGCTGCTAAATTAATAAACATGGGACAAGTAAACAATGAAACTGTGCATCGTTACTATCCGACACATATGTACGGACGTCCATCACCTAAGGAAGATGCCTTCGTGATTAATGAAATTCTTCCCCGTCTGAGTCAGGAGCTTCAACTGGCTATTGAAAATGGGGATAGTCGAAAATCACAAGTATATATTAAGGCTATCGGCGAACTTGGTCACCCAGCTATCCTGGATATATTTAAACCGTACCTTGAAGGCAAAATTCCGGCTTCAACTTATCTTAGAACCAGAATCATAGAACATCTCTATGTTCTGGCCAAAGGAAGGGATGATTATGTACGTGCTGTGTTATTTAGCGTTTTGAAAAATACTGCTGAACCATATGAAGTAAGAGTAGCAGCCATCGATAAAATCTTTATGTCACGACCAAGTACAGCGATGATGATGGCAATGGCACAAATGACTAAAGACGATCCTAGTATCCAAATCCGTGCAGCGCTTAAATCGGCAATTACATCTGCATCAGAACTTAAAAATCCAAGATTCCATGACCTGGCAAGAACAGCAGCAGCTGTTAAGGATATGCTCACAAGTGAAGAGTTTGGTTTACAATACTCTGGTAAAAACTTCCTGGAACACTACGACAGGGATGAGCAACCAAGTTCTATGTCAGTACTCTCAAGACTGGGAAGCAAGGATAGTCTGCTTCCGAAATATTGGAGATATTCATGGAAAGGAAGAGACGGAGGTTGGGATCAAGAAACAGTTATCTCAGGAGCTGCTTCAAGTTGGCAGGAACTATTTGATCTCTTCGCAGATCAGATGTTTGGACAAAGAAAACCCGATCAATATCCCGAATACAATCCTAAATACTCCGCTGAAAAGATTGCTGACATGTTGAACGTAAAAAAAGACGACCGAGAATCATCAGAGGGCTCATTTTATATAAATTTACTAAATCAGAGGAGATACTTTGCTTTCAATGAAAATGATGTTAAAGAATTAGGCATTAAATTTCGCGAGTACTTAACAAATCTCAAAGACGTTGCTAAGCAATACACTAAAGTCGTTAACAGGAACCAAGTGTCAGTCATGTTCCCTATAGCTACAGGAGTACCATTTATTTATAAATATAAGGAACCGGTTCTCCTACATGTTCGTACTGTAACTAAAGGAAACGTTGATTTTAAGGATAGAGAGGAATATAGGTCTAGTGCTTCTATCAATAGCGAGCTGCGGATAATTTACGCTGAAAATCATGATGGCAATGTTGGTTTTCTAGACACTCTTGGTAATCAACTTGCAAGCGTTGGATTAGTGAGAAAAAGTCAACTTAATATTCCAATTAAAATAGATCTTGAAATGAAATCTGGAGAAGCGAAGTTCCATTTAAGTCCAATGGAACCCGAACAAGATAATACTATAGCTCATTACAGTGTTTGGCCATATTCCGCAAACCAAAAGAAGGACACTTTAACACCTATTTCTCAGGATCCTATATCAAGAGTTATTATGAGACCCGAAAAAGTAGCCCAGATTGATAGCAAGTTTGGACAAAACTTTGGATCCATATTCCAACTCCAGGGTTATTCTTATTCTGAAGATTACAGGTACATAGGAGACATGCTGAAGTCCTACAATTATTTAACTAGTATTATCAGGATGTTCAAGCAAAAAGATATAGCTCAAACTCACTTTAATCTGAGGTACTTGGGAAAGCAATCTAAGAACAAAGGAGTCACAATCACAGTAGCTTACGACACACTGTATAATCAGAAAGAAACAGGCGTTATGCCAATAACTGCATCGGATGTGAAGGACTCGACACCCAACAGTCCATCACGACGAGAGGAATTAATTAAACGTGTTATAGCTGGCATACAATCATCTAGAGCCCACGTCGTTGATTTGAGCGCAAAATTCGAGACAGAACAAAAATTGGAGTATACTGCGACCCTTGCAATCGGCGCGAGTGTCGTCGATCAAAAAATTCAGTTTGCTTTATTTGCTGGTAGAAACTCTGATCAATACGGATCAAATCAGTTAAATGCCGTAGGTAGAGTTACGAAACCATTGTCAGATTCCCCTATTAATTTCCAAAAAGCACTAGAAAAAGAACTGAAAATGGATTTTGAGGCCGATATCCTTTACAACCAGAAAGAAAATATCCACATTCTTGGCTCTGCCGAAAGAACAAAGAGATATATAGAAGAACTTCAGAAAGAACCACAAGTAAAGAGATGTCTTGAAAATTATGCCAGAGGTAATTATTACCAACACGACTGTCATGAAGCGGTTGTTATGGCCCATGCTCCAGACAACTTCAAATTCAGTGTAAGTTACAAAGACGTCAGCTCTGGGACTAAAAATGCTGCAGCCTACGCTTACAGAATTTTAGACGGGCTTAATTTATGGAGATCGGATATTAATATGGCAAAGACGTTACCTGCTGGAAAACTTGAATTGAACGTTGATGCTTTATACTGGACAAGAAATTTAAATCTTATTGTAAATTCTCGTTTTGGGGAATTGCGGGTAAACAATATACCTATACCTGAAGTTACTTCTAGAGCTGTGTCTATGTACTTACCGATCAGCGCCTATGAGCGAATTCTAAATTATTACACCTGGCATCAGTATCAACCATATTGCAGTGTGGACAGTAACAGGGTGAGGACCTTCAGTAACCGTGAATATGATTACACGCTGTCACCTTCCTGGCACGTAGTGATGCACGATGACAGACCCGGCAGAAACGAGGATTTAGTCGTGCTGTCCAGAAGACCTCAAGAAATGAAAATGCAAATATACTTATCTTACAGATCTTACACTGGCAAATACATAGAGATGGAAGTTCAACCAGCCCCGGACACTCAACAGAAGCACTCTGTTCAAGTCAAGACCAATGCCAAAAAAGTGTCTGAAGGAGAACTTACAACCTACTGGGACGACGTCAATGACAGTCCGTTACTTGAATACTACAGTACTGGCGACAATGTCTTAATGATCAAATTGCGTGAGAATCGTCTCAGAATCGTGTATGACGGAGAAAGGAGCGTAGTTCTTTCGAGAGACAACCGCAAAAACATCAGGGGAATTTGTGGAAGAATGAGCGGTGATCCTCGCGATGACTACCTAACACCTAGTGGTCTCGTAGATAAACCAGAATACTATGGAGCTTCCTACGCTCTTATTGAAGACGAGAATGATCCCAGAACACAAGAATTGCAATCGGAAGCTAAAAGAAAGGCGTACGAGCCAAGAAAACAATACACCACAATCTTGCAATCTGATAACAAATGGCAAAATGCTATGCTCTCTTCGTCTGAAGATGATTGGGACTCTCAGATCGTATACAGGGCAAGGAACTATGGAAAGAGTAAGGGAAAATGTAAAGTAGTCCCTCAAGTGCAGTATTATGAGAACCAATCACAGATCTGTATAACCACCAGTTCCTTACCGTCCTGCCAGTCTTCCTGTAGCGGAGGCAGCTACAAGATTCAGTCGACACAAGTTGTTTGCCGCTCCAAGCTGGACTCTCAATTCCAATCTTACAGAGATGAAATCAAACTAGGCAAAAGTCCCAAAGTCAGCGGAGAGCCGCGAACTGTAGACTACAGAGTCCCTAGTTCTTGCAAATCCTAA

Protein sequence:

>DPOGS213644-PA
MKLLVLAATIVVVSSGQLSDVVVESPWPWQVGKLYRYDVETHTLARYLDSFSSGNAFRAKFTVRAKSDGYLQARLENPEYAKVYQKLEQHDPMPEDLKYVPVANLDKPFEIYIEGGRILSVKLPFSVTLMQENLIKGLIGSLQVDLTSHRNVKSSHDTYDTQVQQGLFRKMEIDVTGDCETLYTVSPAASEWRRELPSFASDDEPIEITKSKNYGHCHHRVDYHFGVPEGAEWSGTAHKTGKEQFINRATVSRMLVGKNGHIYKAETTSTVTAHPHLYGEQKAQVHGKVRFNLMSYEDDNEPAWVYPEGAREVTNLLYALTAKPIDIGDSSSSEKSIKIEKHPRQRRSSRMKSFVSINKKIVTETHGSSSSSESDSVYVNDDIPNINEPAYASLYMNPDLHGDKKQNPMNAQKLLQEIAQQLQNPNNMPKADTLSKFNILVRVIASMSYGQLGLTSRSIEIAKLANDVVKSNMWMIYRDAVAQAGTLPAFQQIKAWIESKKLEGEEAAEVISVLAVSLRYPTKVVMKQFFDLAMNPEVTKQMFLNDTALIAAAKLINMGQVNNETVHRYYPTHMYGRPSPKEDAFVINEILPRLSQELQLAIENGDSRKSQVYIKAIGELGHPAILDIFKPYLEGKIPASTYLRTRIIEHLYVLAKGRDDYVRAVLFSVLKNTAEPYEVRVAAIDKIFMSRPSTAMMMAMAQMTKDDPSIQIRAALKSAITSASELKNPRFHDLARTAAAVKDMLTSEEFGLQYSGKNFLEHYDRDEQPSSMSVLSRLGSKDSLLPKYWRYSWKGRDGGWDQETVISGAASSWQELFDLFADQMFGQRKPDQYPEYNPKYSAEKIADMLNVKKDDRESSEGSFYINLLNQRRYFAFNENDVKELGIKFREYLTNLKDVAKQYTKVVNRNQVSVMFPIATGVPFIYKYKEPVLLHVRTVTKGNVDFKDREEYRSSASINSELRIIYAENHDGNVGFLDTLGNQLASVGLVRKSQLNIPIKIDLEMKSGEAKFHLSPMEPEQDNTIAHYSVWPYSANQKKDTLTPISQDPISRVIMRPEKVAQIDSKFGQNFGSIFQLQGYSYSEDYRYIGDMLKSYNYLTSIIRMFKQKDIAQTHFNLRYLGKQSKNKGVTITVAYDTLYNQKETGVMPITASDVKDSTPNSPSRREELIKRVIAGIQSSRAHVVDLSAKFETEQKLEYTATLAIGASVVDQKIQFALFAGRNSDQYGSNQLNAVGRVTKPLSDSPINFQKALEKELKMDFEADILYNQKENIHILGSAERTKRYIEELQKEPQVKRCLENYARGNYYQHDCHEAVVMAHAPDNFKFSVSYKDVSSGTKNAAAYAYRILDGLNLWRSDINMAKTLPAGKLELNVDALYWTRNLNLIVNSRFGELRVNNIPIPEVTSRAVSMYLPISAYERILNYYTWHQYQPYCSVDSNRVRTFSNREYDYTLSPSWHVVMHDDRPGRNEDLVVLSRRPQEMKMQIYLSYRSYTGKYIEMEVQPAPDTQQKHSVQVKTNAKKVSEGELTTYWDDVNDSPLLEYYSTGDNVLMIKLRENRLRIVYDGERSVVLSRDNRKNIRGICGRMSGDPRDDYLTPSGLVDKPEYYGASYALIEDENDPRTQELQSEAKRKAYEPRKQYTTILQSDNKWQNAMLSSSEDDWDSQIVYRARNYGKSKGKCKVVPQVQYYENQSQICITTSSLPSCQSSCSGGSYKIQSTQVVCRSKLDSQFQSYRDEIKLGKSPKVSGEPRTVDYRVPSSCKS-