Monarch geneset OGS2.0

DPOGS213188
TranscriptDPOGS213188-TA2685 bp
ProteinDPOGS213188-PA894 aa
Genomic positionDPSCF300114 - 4173-9193
RNAseq coverage654x (Rank: top 20%)
Annotation
HeliconiusHMEL0102823e-7657.78% 
BombyxBGIBMGA007390-TA2e-11638.18% 
DrosophilaMtp-PA2e-3724.70% 
EBI UniRef50UniRef50_E0VGP32e-5524.97%Microsomal triglyceride transfer protein large subunit, putative n=1 Tax=Pediculus humanus corporis RepID=E0VGP3_PEDHC
NCBI RefSeqXP_001654415.19e-5826.23%hypothetical protein AaeL_AAEL010280 [Aedes aegypti]
NCBI nr blastpgi|1571256602e-5626.23%hypothetical protein AaeL_AAEL010280 [Aedes aegypti]
NCBI nr blastxgi|3838578314e-5826.16%PREDICTED: LOW QUALITY PROTEIN: microsomal triglyceride transfer protein large subunit-like [Megachile rotundata]
Group
Gene OntologyGO:00053195.3e-14lipid transporter activity
GO:00068695.3e-14lipid transport
KEGG pathway 
InterPro domain[38-250] IPR0158165.3e-14Vitellinogen, beta-sheet N-terminal
[39-260] IPR0158194e-10Lipid transport protein, beta-sheet shell
[113-176] IPR0017475e-07Lipid transport protein, N-terminal
Orthology groupMCL12672 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213188-TA
ATGTTGAGGTATATAATAATATTATATTGTTCTGCGATTTTACAAACGACTACGTCTTCTTCCGAAAGCCGGACGGATTCCCGTCAGTTGAAATTGTTTGAAGAGCCGATATCTTACGATGTGGAGTCAACGGTGTTGCTGAACGAGCTGGAGCGGAAAGAAAAGGAGGTTAGCTACAAAGTCAAGGCAAAGCTTAATGTAATCCCACTCTGGACCGAGGCCGGCGTCCAAAGTGTTCTCAAATTCGAGCTCCTGTCACCACATCTATTTTCCCGAAGTAAGAGTGTGACAGCAGATTACTTACCGATGAATTCCATCTGGAACCTGTATTCACATTCCACATTTTATGCTCATTGGAAGCTCGGTCTCATCGAGACTGCTTATTTGGATCCCAAAGATCCAGTGGCGATACAGAATTTTAAAAGGTCCCTGATGAGCGTGTTCCAGTTTGAAGCGGTCGGCAAGGAGCTCAATGAAACAGACGTCTCCGGTACTTGTCATGTTCAATATGAGACCACTTCAGCCAACACAGTCAGGAAGTACAAGAGTTCATGTAGACTGGACGACTGGCCTGAGATAGAAGAGGGCGTGGTGAAGTCGCGGAGACTGGCCAGATACACTTTGGATGACCGGCGACACTCACTCCGGGACCTGCACGCTGAGGAGCTGCACGAGTTGGTGGCGGGCGGCAGGGGAGGGAGGGGAGGACTCAAGGCGCGGGCCTGGCTGAGGCTCACCGCGGATGAGGCTGCGGGGGCCATGGTCCCACAGGCTTCCTCTTTGGCCGAGGCTCTGGCCGGGCTTCCCGACCACGTGGCGGCAGAGTCGCTCCAGGCCGCACCCGTCCAGGTGACGGACCCGCTCTACTTGGAGGAGAGTTTGGAGATCGAATTGGAAGCGGCCATTTTGGAACTGCGCGACCACTCGGCGGAGAGGCACCCGGGGCCCGAGGGGGGTGAGGGGGGAGAAGCGATGACAGCCCACGTCATCAGGAGGCTTGTACGAGCGCTTAGAGCGGCTCCACTCCCCGCACTGGTTCCGATCCTCTCCGACGACCACGATCCGGAATACCTGAGGCAAATGTGTATGGCGCTGGGGCTGGCGGGGACCGCGCATACACACGCCACGGCGATGCGCTTCTTCCGGCTGAGATCGCGCGACGCCCCCGTCAAGTTGGTGCACACATACCTCGCCGCGTTGGCGCTCACACCGCAACCACACGAATCCGTCGTAGAGGACGTGCTGCGTCTGGCCGAGGAGATGACCGACCTCGAGCTAGCGGAGAGCGCGCTGCTGGCTGGGGCCGCCGCCGCCGCCGGGGACGAGAGGCTCTCGGCCGGCGCCAGGCACGCGCTGACCAAGAGCCTCGTGCGGTGTAAGACCGACGAGTGTCGCCGGGTGAGGATCGCGGCCCTCGGCAACTTGAAACGGGAGGACACCGTGGAGCTGCTGCTGGAGCACGCGGAGGCGGGGCGGCGGGGGGTGGCGCTCGCGGCGCTGGACGCGCTGGGCGGGACGCTGGCAGCGCTGCGAACAGACACGCGCCTCCCGCGCCTGGAGCGGCTGGTGCTGCGGCCGAGCCCGCTGGAGCTCCGCGCCGCCGCGCTCGACCTGCTGCTCCGCTGCAGCGCGGAGGCTCCCTTCGGCCTCACTCGCTTGATGCTGGAGTTGCACGGAGGAGCGCCGCTCGAACTCGTGCGCCTCGCCTGGCACCGCCTGCACGCGCTCGCCGACGACCACCCGCACGTCCGGAACATGATGAAGCTCCTCCCTCGACGACTCCGAGGCTGGGAGGTGCAGGCCATGGCGGGCACTTCGTCGGTACTGGTGCGGGAGCTGGGTGCGGTGGACGGCTGGCGCGCCCGCCTGGAGTCCGTGCAGGTGGCCAGCGGCGGACTGCTCCGTCGAGGACGAGTTCATTTGTCAGCCGTGGCACCACACGGAGACTCGGATGACACCCTGGCTGTCGAATTTTGGACCAGAGGACTCGAAGCCATCGCGGGAAACGGTCAGAACTCCGACGAGGAAGACGAGGAGGGTGTGGAGGTTAGCGGCGGCCTAGCGCTCTCCGTCGGGGGTGTCCGCGGCAGGAGCGTGACGCTGTTCACGGGTCAGGGCGAGCTGCTGGGGCACGTGTGGGCCGGCACCGCCAGCGAGCCCACGCCCGTGCTCCGCGCCGCCAGGACGGTGGGGGCGTCCGGCGTCGTTCTGCCGCTGCTGGACGGAGCCGCTCTGCGCCTGGAGCGGGACTCGCTGCAGCTGCTGGCGTTGGACGCGGCCGCGCAGGTGTCGCTGTGGTCGAGGAGCGCACGCTCGCAGATGGGCTTGAGTGTCGCGCTCGAGGCGGCTTGGGGAGCTCGCGTGGAGGCGGCCGGCGCTCGATTGACGGCGCAGAGCTGGCTGCGCGGCGCACCGAGGATGGCGGTCGAAGCGGACCTCGATTTCTACGACGGAAACGTGCTTTGCGTCCGAGTGTACACCATCGGCTGTGACAGCAGCTACGGGTCCGAGCTGGTGTCCAGGACGGGCATGAAGCACGTGTGGCGACACACACATACAAACACACACACACACACACACACACACAAATACAAATACACACACACACAAATACAAATACACACACACACACACACATACACAATACAAGTACAAATACACACACATGATCATCTAGGTGTGTAA

Protein sequence:

>DPOGS213188-PA
MLRYIIILYCSAILQTTTSSSESRTDSRQLKLFEEPISYDVESTVLLNELERKEKEVSYKVKAKLNVIPLWTEAGVQSVLKFELLSPHLFSRSKSVTADYLPMNSIWNLYSHSTFYAHWKLGLIETAYLDPKDPVAIQNFKRSLMSVFQFEAVGKELNETDVSGTCHVQYETTSANTVRKYKSSCRLDDWPEIEEGVVKSRRLARYTLDDRRHSLRDLHAEELHELVAGGRGGRGGLKARAWLRLTADEAAGAMVPQASSLAEALAGLPDHVAAESLQAAPVQVTDPLYLEESLEIELEAAILELRDHSAERHPGPEGGEGGEAMTAHVIRRLVRALRAAPLPALVPILSDDHDPEYLRQMCMALGLAGTAHTHATAMRFFRLRSRDAPVKLVHTYLAALALTPQPHESVVEDVLRLAEEMTDLELAESALLAGAAAAAGDERLSAGARHALTKSLVRCKTDECRRVRIAALGNLKREDTVELLLEHAEAGRRGVALAALDALGGTLAALRTDTRLPRLERLVLRPSPLELRAAALDLLLRCSAEAPFGLTRLMLELHGGAPLELVRLAWHRLHALADDHPHVRNMMKLLPRRLRGWEVQAMAGTSSVLVRELGAVDGWRARLESVQVASGGLLRRGRVHLSAVAPHGDSDDTLAVEFWTRGLEAIAGNGQNSDEEDEEGVEVSGGLALSVGGVRGRSVTLFTGQGELLGHVWAGTASEPTPVLRAARTVGASGVVLPLLDGAALRLERDSLQLLALDAAAQVSLWSRSARSQMGLSVALEAAWGARVEAAGARLTAQSWLRGAPRMAVEADLDFYDGNVLCVRVYTIGCDSSYGSELVSRTGMKHVWRHTHTNTHTHTHTHKYKYTHTQIQIHTHTHTYTIQVQIHTHDHLGV-