Monarch geneset OGS2.0

DPOGS206690
TranscriptDPOGS206690-TA1416 bp
ProteinDPOGS206690-PA471 aa
Genomic positionDPSCF300048 + 1283285-1286589
RNAseq coverage215x (Rank: top 45%)
Annotation
HeliconiusHMEL0088382e-11674.64% 
BombyxBGIBMGA008523-TA7e-16875.65% 
DrosophilaCG14232-PA2e-10647.07% 
EBI UniRef50UniRef50_Q7QBG25e-11951.32%AGAP003185-PA n=5 Tax=Endopterygota RepID=Q7QBG2_ANOGA
NCBI RefSeqXP_001651890.15e-11949.05%hypothetical protein AaeL_AAEL006309 [Aedes aegypti]
NCBI nr blastpgi|3479694252e-11851.32%AGAP003185-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3320303362e-12552.09%Protein TMED8 [Acromyrmex echinatior]
Group
Gene OntologyGO:00068109e-29transport
GO:00160219e-29integral to membrane
GO:00000624.9e-17fatty-acyl-CoA binding
GO:00054887.6e-16binding
KEGG pathway 
InterPro domain[320-471] IPR0090389e-29GOLD
[35-127] IPR0005824.9e-17Acyl-CoA-binding protein, ACBP
[36-128] IPR0143527.6e-16FERM/acyl-CoA-binding protein, 3-helical bundle
Orthology groupMCL16097 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206690-TA
ATGGCTAGTAAGGTGGAAGGTAATCAAAAAGTAGATCCAGTAAGAGTTCCGTTAGAAACTGATCAAGGTATACTTGAAAGCGAAGAACGTAAATGGGGACTACCATTAAAAGAAGCCTATAAACATGGACTATCTTTCTATAAAGAAAAAGAAGGCAAAGCTATACACTTAAGCTATGAAGATAGATTGAAGCTTGTTGCATACACACAGCAAACAGCCCATGGTCCGTTAGATGTAAACTCAGCACCACCATTGGGAGTTTTGGATGTCATTGGAAGAGATAGGAGAGCAGCTTGGCAAATGCTTGGACAAATGTCTCAAATACAGGCCATGGCTGGTTTTGTACACACACTTGATAGATTATGTCCCCTCTTCAAACCATATTTAGAGGCAATTCAGAAAGATCTAGAAGAAAAGAAGCAACAAGAGCTGAAGAAGTCCGAAGAAGAAAAGGCACATAGTGAACTACAAAATAGGATAATTATGGAGAAGGAAAAGCAGCAAAGCACAAAATTAACTGAAGAACAACAAGTTCAAAGAATAAAGGATGCTTTGAATGCACAAACATATGACCAGTTCCTTCAATATGCGCAGCAACAGTACCCTGGCAATTTTGATCAACAAGCTATTCTTATAAGGCAGTTGCAAGATCAGCATTACCAACAATATATACAACAGCTGGCAGTGGACCAGCGACTTGCTAACTCCAGTATTGATCAGAAAGACGACTCTGATTACGAAAATGATATATCGGGAAAAGAAATAGCAAAGGATTGCAATCTGAATGAAATTGACAATGGGATTACAGATAATAAGGAAATTCAAACTATAGACAAAATTAACAGTGAATATGTTGAAGAGACAAGGGAAGAAGAATCAGATGGAGACGATGGCCTTCCATATGTGGAAGAGGCTAGAATGTGGACTCGGGGTGATATTATTCAGTTTAAGGAATCAGCTCGCGAAGGTGGTGGACGTTTGACAGTCGGTCATGGTGAGACCGTCACTGTGAGAGTCCCAACACATCCACGAGCTGGCTGCCTTTGTTGGGAGTTTGCTACCGATGGATATGATATAGGTTTTGGATTGTACTTCGAGTGGACCAAGACTCAGACCACCGAAGTAACGATTCATGTTTCTGAGACTGATGAAGAAGATGATGAAGATGATGGTGACGAAGAATTCACGGCCCAAGAATCTAATGATCCTGAGATTGGTTCAGAAATACGCTCTCTAAACAAACAGAAGCCTTTGTTGAGCCTCGTTGTACCCATATATAGGCGAGATTGTCATACAGAGGTGTACGCTGGTTCCCACACTTACCCCGGCGAAGGAGTATATTTACTGAAGTTTGATAACACTTACAGTCTTTGGCGATCCAAAACCCTGTATTACAAAGTCTATTATACACAATAA

Protein sequence:

>DPOGS206690-PA
MASKVEGNQKVDPVRVPLETDQGILESEERKWGLPLKEAYKHGLSFYKEKEGKAIHLSYEDRLKLVAYTQQTAHGPLDVNSAPPLGVLDVIGRDRRAAWQMLGQMSQIQAMAGFVHTLDRLCPLFKPYLEAIQKDLEEKKQQELKKSEEEKAHSELQNRIIMEKEKQQSTKLTEEQQVQRIKDALNAQTYDQFLQYAQQQYPGNFDQQAILIRQLQDQHYQQYIQQLAVDQRLANSSIDQKDDSDYENDISGKEIAKDCNLNEIDNGITDNKEIQTIDKINSEYVEETREEESDGDDGLPYVEEARMWTRGDIIQFKESAREGGGRLTVGHGETVTVRVPTHPRAGCLCWEFATDGYDIGFGLYFEWTKTQTTEVTIHVSETDEEDDEDDGDEEFTAQESNDPEIGSEIRSLNKQKPLLSLVVPIYRRDCHTEVYAGSHTYPGEGVYLLKFDNTYSLWRSKTLYYKVYYTQ-