Monarch geneset OGS2.0

DPOGS210299
TranscriptDPOGS210299-TA1491 bp
ProteinDPOGS210299-PA496 aa
Genomic positionDPSCF300305 - 105919-111691
RNAseq coverage320x (Rank: top 36%)
Annotation
HeliconiusHMEL0028840.084.15% 
BombyxBGIBMGA013893-TA0.071.21% 
DrosophilaCG15828-PC9e-8032.85% 
EBI UniRef50UniRef50_D6W7B82e-9339.91%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W7B8_TRICA
NCBI RefSeqXP_971721.21e-10140.73%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892338213e-10040.73%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892338212e-9540.17%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00053191.1e-55lipid transporter activity
GO:00068691.1e-55lipid transport
KEGG pathway 
InterPro domain[51-289] IPR0158161.1e-55Vitellinogen, beta-sheet N-terminal
[48-312] IPR0158192.8e-46Lipid transport protein, beta-sheet shell
[53-477] IPR0017475.8e-43Lipid transport protein, N-terminal
[352-479] IPR0110303.9e-17Vitellinogen, superhelical
Orthology groupMCL10529 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210299-TA
ATGCTTCCTAATAATAGTTTTTTAAATAGAAAGCTAGAGGAATTACATAAAAAGGAATTACAAACTTATCTTCTACTTTCAGATTCTCTAAAGGACCCATACATTTGCGGTCCACCGACCTGTATTTCGTCAGAGAAATTTAAATACCTTACGGATGTTAAATATGAATATGAATACAAAGTAAAAGTGGAAACTTACTTCGCCGGTTCCAGTAATAACCGTTCTACTTTAGATGTATCTGCCCAAGCAACGGTTCAATTCATCAAGCCATGTGAAGGTTTACTACAACTAAGTAATGTCAAACTAAAGGATCAAGATGAAAATTATCCAGTTGAGAGAGCTGAGAAATTTGTACAGGCTGTATCCACACATGACCTCAGATTTGCTTTCCATGATGGTATCATATCTGAGATATGTCCAGATGAGATCGAAGAAGATTGGGTACTTAACTATAAGAGGGCAATACTTTCACTCTTTCAAAATAGCATGAAGAGATTTGACATTGACTTCAAAGGAATCGAGCAAGACATTCATGGCACATGTGATGTTTCTTACACCGTCAGAGGCCAAGAGAACACTAGTCTGATACTAGTTAAGACAAGAGATCTATCATTATGTACGGACAGATACAAGTATATGTCCATACTGCAAACTGTGAGATATGACTTCCAGAGTAAATTCCAAACATGGCCAGTACTAAAATCTGAAAGCAAATGCCGCATAGCAGTTGATCATCATATATATAAGTCAGTTAATTGTAGAGAGAGACATTTATTTGAGCCGTTCTCTGGTAAAAATTCAGGAGCTATGACAACTGTTATACAGGACTTGGTACTTATAGATGAATACAATAGAACTGGTTCAGAAACTATGTACACTGAAAATAAAGCTTGGGCGATGATAACAAAACGATCAAATATCCTCCACCACCACGCCCCAAACATCAGATCCGACACAGGTGAATTGAAGTCAGCTAGAGATGTATTGAAATTACTTTGTATGGTTAAGCCAAATTCTGATGATGAAGTAAGAACAGTTGATGAGAACATGGATAGTGGTTCGACCGTGGGTCTTTGGGGCCGGTTAGTGAGATCAGCTCGCAAGTTACATCACCCAGCGCTAGCACAACTATTGGCGAGGGCCCCCACTATATGTGACAGCGCTTCGAAACACATCCTAGACGCCCTTCCATATATAGCGAGCGCTGGTTCAGTCGAACTTATAAAGAACATGATAATAAAGAAGGAAGTGGATTCAGATACACGACACGAATGGCTCATGTCAATGGCCATGATACCACGACCAAAAATAGAGATGTTAAAAAGTATGTTGGAACTTCTCCAGACACAGAGGAATGATCAGGTTATAAGTTTCACTGTATCGTCCATGGTGCACTCATATTGCAAACACAGTAGAAAAGCATGTAAAAAAGATACTTTTAACAGAAAAAATGTTAATGAAGCGACTATCCTGCGTGATATAGAAACATAA

Protein sequence:

>DPOGS210299-PA
MLPNNSFLNRKLEELHKKELQTYLLLSDSLKDPYICGPPTCISSEKFKYLTDVKYEYEYKVKVETYFAGSSNNRSTLDVSAQATVQFIKPCEGLLQLSNVKLKDQDENYPVERAEKFVQAVSTHDLRFAFHDGIISEICPDEIEEDWVLNYKRAILSLFQNSMKRFDIDFKGIEQDIHGTCDVSYTVRGQENTSLILVKTRDLSLCTDRYKYMSILQTVRYDFQSKFQTWPVLKSESKCRIAVDHHIYKSVNCRERHLFEPFSGKNSGAMTTVIQDLVLIDEYNRTGSETMYTENKAWAMITKRSNILHHHAPNIRSDTGELKSARDVLKLLCMVKPNSDDEVRTVDENMDSGSTVGLWGRLVRSARKLHHPALAQLLARAPTICDSASKHILDALPYIASAGSVELIKNMIIKKEVDSDTRHEWLMSMAMIPRPKIEMLKSMLELLQTQRNDQVISFTVSSMVHSYCKHSRKACKKDTFNRKNVNEATILRDIET-