Monarch geneset OGS2.0

DPOGS215110
TranscriptDPOGS215110-TA3657 bp
ProteinDPOGS215110-PA1218 aa
Genomic positionDPSCF300139 + 101856-117628
RNAseq coverage194x (Rank: top 48%)
Annotation
HeliconiusHMEL0045811e-11853.22% 
BombyxBGIBMGA009599-TA0.039.56% 
DrosophilaCG31150-PA6e-3826.12% 
EBI UniRef50UniRef50_Q177S41e-3825.63%Putative uncharacterized protein n=4 Tax=Culicidae RepID=Q177S4_AEDAE
NCBI RefSeqXP_001657388.12e-3925.63%hypothetical protein AaeL_AAEL006028 [Aedes aegypti]
NCBI nr blastpgi|1571120425e-3825.63%hypothetical protein AaeL_AAEL006028 [Aedes aegypti]
NCBI nr blastxgi|1700313871e-3825.27%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00053196.2e-09lipid transporter activity
GO:00068696.2e-09lipid transport
KEGG pathway 
InterPro domain[24-195] IPR0158166.2e-09Vitellinogen, beta-sheet N-terminal
[23-162] IPR0158199.3e-07Lipid transport protein, beta-sheet shell
[569-768] IPR0152551.1e-06Vitellinogen, open beta-sheet
[285-495] IPR0110308.8e-06Vitellinogen, superhelical
Orthology groupMCL25626 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215110-TA
ATGATATATCGATGTATACTTCTGTTTGCTGTTCTCGCCCCGGCCTCGGCGACGGGGATACAAATGCTTTTCCCTGATCAGAAGCAGTACAATTACGTTGTCAAAACAAATATATCAACGGGCGTCGCCCATAGGAATTCTTACTGGACTTTGGAGGGCAGACTAGTTGTACTTGTGGACGATAATTACTATGCCACAGTTCAATTTAAACTGGAGGATTTGAAGACATCTGTGTATAGTTATAACACCGGTTTTTCATCGTATCACACCCCCGAAGCGGCCAGGGAATTGGAGGAGCCGTGGAAAATTATTTATCAGGAGAACGGTTTCATAAGTGCAATCCAGCATTTGCCGCACGAGAGAGTTTGGGTGACGAACATAAAGCGAGCGATATCCGTCAATTTCCAGTTGAAAAAGGACGCCGGTTCTTATACCAATGAAGAGCCGTGCCTGTACGAGAGTTGTGTTATGGTGTATTCCGTGCAAGGGAACACAATCAAGAAGTATAACTCCTATCAGAAAGCGTCCATGACCTCACAGACCAGTTGGAGTTCGGTGCCGTGGAGTGGAGATTACGGGAGGGGGGTGCCCGAGAACATAGCGACCTCGCAACGAGTGTACGATCTCGATGAAAAGGGCCTCAATTGTTTGAACATGAAGGGTTTGTTCGAGTACATAGTGGACGGGCATGTGCTCACAGTTACAACTGAGTCGGTGGAGGAGGAGCTCCAGAAGTCAAATCAAATGGGTTACGAGGAACTACAGCATGAGCTATCGTGTGAGATAAGTAAATATATGAACTGTGATGTGTCTTATACCCTCAAGAGGTTACTGCCGGGGGACTTGAAGGATAATATAGAAGATGCTACAATTCTCTCACTGATCCGTAAGCTGCCGTTTAACATAGCGAATCACAGCCAGGCACTTCTTGAGGACATGGAAACTATCAGCAAGTTGGGTTTGGACTTCCCTCAAGAGATACGACATGCTGGAATTCTGTCATTTGCAATAATGGTGTCCCATTGCGTGGAGGCGATGAAGGTCAAACAGGACTATTTTGACAGCATTTTAGTCAAATACTTTAGAATGTACAGTGACTGCCCTCAATATCTGGATCGTCTTATTTGGCTCCAAGGTCTTTGTAGTTTGGGTTACACATCAGAGTCTTACATACGAACGATATACGCTGACAAAACCAGGAATAGACATGAAAGATTGTGGGCATCACTGGCTTGCGGACAGGATACCAGAGGATACAATGTCTTAGAAACTTCTTTGCCAATCTTAATGGACGACAACGAGCACATCCAACTACGTATATCAGCTCTCCACGCTATCCTCAGCTCGGGGATAAGAGAGAGCGACTTTCTTTTCCTCCACAGCTGGATATCCACCAGCCGACCTGAACTCCAGAGGTTCTGGTACTCAACTGTTAAGAGCCTTGAAAGCAACAAATTCCCTAAGTATAGAACTATTTCAGATTACATTCCTTTCGTCTCGAAGGACGTTGTTAACCCTGACTCATCACTGTGGGGCACTAACAACTACATCGTCAGTGGTGATGAGCTTAGTGGCTGGGTACAGGTGATGACTGTTGGAAACCCTGCCCCAACCCTTGCAGCGCTCTCCGTTTCTACTGGAGGACGACGGGCCTATCAGGCTTCGGTGTACATTATAGCTGAAGGAGTGCAGTTTGATAAAGTGAGGAAGTGGAAACAGAGTGATCTCAAAGTGGATAATCTATTAAAAATCCTAGAAAGATTGAATGTGCGAAATTTGAAGACATCAGAAGAGGTTCACATAGATGTTGTCATTAAAATCCAAGACAAAACAGTCTACGCAACTCACATCAACCAGACGAGGTTCGAGTCTTGGAACGGATATGACTTGACGAAATCGATCACGGAGTTCCTGAGGTTCGGCAGTCACATAAACCAGCAAATAGCGTACTATCCCATTCAGATAGACGTCAATGTACCCAGCGATTTAGGCACACCCATCCGCCTTCAGTCGTCCGTGGTGACCTTCACTTCACTCCGCGGTAACCTCACCTCTAACCCGGACAGTGGACTCGCCCTGGATTGGGTTAACGATTTACATATACGCCATCAAGCGACTATAACAACAGCGCTATCAACAATCGCGCCGTTGCTGCAATCAGAACACGAAGTACGCGTCCAGCGTTCCGCGGTAGCGCATCTACCTATCAAATTTAACGTAACCATGGAACAATATGCTAAGTCTATAGCTTTAACCTGGTTGAACCCATTCGCTCAACGTGCCGGCTTCGCCATCCACTCTTGGATCCAAGTACACATGGACGCCAGGCAGCCTGATCTGTACACAGTCTCCAGTGGAATAACTACTGATGATGACAGTGGCATCTTCTTTGACTGTGAGAGAAAGACTTCAGGTGCTGAGGTGGTTGAGAAATATATAATGTCCAAGTTCATGAGCTACGATCTATTTCCAACCAAACATATTCTCAATACAATATCTCGGTTCATAACCTCTTGTGGGGTGATCATCCCTCCCAACCGACTAGTGGGAGGGGAAGACGAGATGGTTCACGTGGAATTTACGTTAGGAGACATTGTGTTCCAGAAAGTGGATAAGATAGAGATGGAATTTGACTTCATGTTGAAGTATTATAGTATAAATGATCCAAATAAACAGATATACTTAAAAATTGATTCAAATACTAAAATAAAAAGTGCGGGGAGGAATCTTTTTATCAAATGGTTCTTATACGTCAACCAGCCGTACTCCTTGGATCCGAAGAAGAAGTTTTGGAAGCTTTGTTATACCCAAAAAGACACGAGCCACGCCCCATCAGACCAAGATATCACCATCCATCCATCTTCCTACCATGGGACATCTAAAATATTGTATCAGACATCAGAAGAATACAACCAGTGTAATCCCAATGAGACAAGACCTAATGAATCTATTGAGACAAAACTTCAGTTGAACTACAAAGGAACGCCGAAAAACAACAGAGGCACCATCGAGAGATATGTCGAAGTCGACATTTTGGGGGAGAATTTGCATAATTTTGATCTACTGCATAATCTGGGCTACGGTGTCAAGACGCCCGTCGCTCAACTATTGGGAAGCTTCGATAAGAACAAAATTAACACTACGTCAGTTATTAAGGAAAAGGATGGTATAGCATCAATACGTGTGAATGGAGGCGTGGAGGTGGAGATGTACGTGGGAGGTCTGTCCTGGCTGCTCGATAGCTGGACCGCCATGCAACTGATGAGGAGGTTCGGGTTATACCGTGAATGTCGTCTTCAAGAGTCGACGGTCCAGATGTTGTCCGGATCCGTGGAACAGCTGCAACCCCTGCAGTGTTCGGAAAGCCTCGTCCTTGCGGATTGCTCTGTATCTCCAAGGTTCGTTATTCTCCGGAAACAAGATGGCGGTATCCAACTCTACGACGGAGATTATCCGTCCAAAAATGCGACGAGCGTACATTCTTCTAAAGTGACAGATTATAAGTCCATCTTGGAAGTTGGAACCAAAATCCTTTCGGAATCCACGGGTGTAGTGATGTATAAGAGATTAAATGAAAACGTTATTCTTTTACCATCTTCCTACATGCATTCTGTGTGTGGAGAATGCGCTGGTCATACAACGTATAACAATTGTTGA

Protein sequence:

>DPOGS215110-PA
MIYRCILLFAVLAPASATGIQMLFPDQKQYNYVVKTNISTGVAHRNSYWTLEGRLVVLVDDNYYATVQFKLEDLKTSVYSYNTGFSSYHTPEAARELEEPWKIIYQENGFISAIQHLPHERVWVTNIKRAISVNFQLKKDAGSYTNEEPCLYESCVMVYSVQGNTIKKYNSYQKASMTSQTSWSSVPWSGDYGRGVPENIATSQRVYDLDEKGLNCLNMKGLFEYIVDGHVLTVTTESVEEELQKSNQMGYEELQHELSCEISKYMNCDVSYTLKRLLPGDLKDNIEDATILSLIRKLPFNIANHSQALLEDMETISKLGLDFPQEIRHAGILSFAIMVSHCVEAMKVKQDYFDSILVKYFRMYSDCPQYLDRLIWLQGLCSLGYTSESYIRTIYADKTRNRHERLWASLACGQDTRGYNVLETSLPILMDDNEHIQLRISALHAILSSGIRESDFLFLHSWISTSRPELQRFWYSTVKSLESNKFPKYRTISDYIPFVSKDVVNPDSSLWGTNNYIVSGDELSGWVQVMTVGNPAPTLAALSVSTGGRRAYQASVYIIAEGVQFDKVRKWKQSDLKVDNLLKILERLNVRNLKTSEEVHIDVVIKIQDKTVYATHINQTRFESWNGYDLTKSITEFLRFGSHINQQIAYYPIQIDVNVPSDLGTPIRLQSSVVTFTSLRGNLTSNPDSGLALDWVNDLHIRHQATITTALSTIAPLLQSEHEVRVQRSAVAHLPIKFNVTMEQYAKSIALTWLNPFAQRAGFAIHSWIQVHMDARQPDLYTVSSGITTDDDSGIFFDCERKTSGAEVVEKYIMSKFMSYDLFPTKHILNTISRFITSCGVIIPPNRLVGGEDEMVHVEFTLGDIVFQKVDKIEMEFDFMLKYYSINDPNKQIYLKIDSNTKIKSAGRNLFIKWFLYVNQPYSLDPKKKFWKLCYTQKDTSHAPSDQDITIHPSSYHGTSKILYQTSEEYNQCNPNETRPNESIETKLQLNYKGTPKNNRGTIERYVEVDILGENLHNFDLLHNLGYGVKTPVAQLLGSFDKNKINTTSVIKEKDGIASIRVNGGVEVEMYVGGLSWLLDSWTAMQLMRRFGLYRECRLQESTVQMLSGSVEQLQPLQCSESLVLADCSVSPRFVILRKQDGGIQLYDGDYPSKNATSVHSSKVTDYKSILEVGTKILSESTGVVMYKRLNENVILLPSSYMHSVCGECAGHTTYNNC-