Monarch geneset OGS2.0

DPOGS202824
TranscriptDPOGS202824-TA1320 bp
ProteinDPOGS202824-PA439 aa
Genomic positionDPSCF300018 + 502923-523365
RNAseq coverage58x (Rank: top 69%)
Annotation
HeliconiusHMEL0026622e-12288.63% 
BombyxBGIBMGA010441-TA5e-7877.83% 
DrosophilaCG6565-PA6e-0726.67% 
EBI UniRef50UniRef50_E9IA519e-7145.51%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9IA51_SOLIN
NCBI RefSeqXP_395822.32e-4858.33%PREDICTED: similar to T28D6.7, partial [Apis mellifera]
NCBI nr blastpgi|3228019943e-7045.51%hypothetical protein SINV_04543 [Solenopsis invicta]
NCBI nr blastxgi|3228019941e-6943.79%hypothetical protein SINV_04543 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[11-149] IPR0233932.4e-41START-like domain
[186-279] IPR0029131.5e-15Lipid-binding START
Orthology groupMCL15943 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202824-TA
ATGGGCGGCGCGGGTTGGACGCGTGAGGCGCGAGTGGCTGAGGACTCCGACTTCCAGACCCTGAAGAACCTGCTGTCCTCCGAGGACGGATGGACCCTGGAGTACGAGAAGGATGGAGTCAAGGTCTGGGCGGAGGACGCCGCTCACGGCGCGCTGCGAACCGTCAAGGTGGTCGCTGAATTCGAAGATGTGGATCCTGAAGCGTTGTACGACGTGCTTCACGACCCGGAGTACCGATCAGTCTGGGATACACACATGCTGGCGGCCGAAGACGCAGGTCACATCAATGTCAACAACGATGTCGGATACTATGCCATGTCGTGTCCCGCACCGCTCAAGAACCGGGACTTCGTCCTGCAGAGGTCCTGGCTGGACACGGGAGATGAGAAGATGATCCTAAACCACTCCGTCTATCACAAGGATTACCCGCCAAGGAAAGGTTTCGTCAGCTGTGTAGCTTCCTCCGACATCTTATTACAGAATGGCATCCTCAATACGTTTGTATTACTTTATATAAAGGAAATTCTGTGTTCAGAGGAAATGTCAGTGTTAATGTCGTGTCCCGCACCGCTCAAGAACCGGGACTTCGTCCTGCAGAGGTCCTGGCTGGACACGGGAGATGAGAAGATGATCCTAAACCACTCCGTCTATCACAAGGATTACCCGCCAAGGAAAGGTTTCGTCAGAGCCCTTTCGTTGTTGACGGGGTTCGTAGTGCGACGGCGGAATGGTCCTGGCAGCTGGCTGGGATACGTCTCTAGATCGGACCCACGAGGAGCCCTCCCTGCGTGGCTTGTTAACAGAGTAACGGCTCAGTTGGCACCTCGTCTCGTGCATCAGCTCCACGCAGCCTCCCGCCGGTACCCTGGGTGGAAGGCGCTGACCGACACCCCCTATTACCAACCCTGGAGGAATCCAGAACAAGTGCCACCTTACCGCATCAATTTAGAGGATTGCATTGACCCCGAAGCTCCCCCACCTCCGGTGGAACAGAAGCCCAATACTTCGAAAACAAAACTTGAAGTACCGCAAGTTAAAGAAGTAGAAAGTCATAGTATCGAAGATTTGGGAGACATATCATCAGAATTCAGTGTGGAAGTAGAAGAACTAGCAGATCTCACTCTCGATCCGACTAAGAAGAAAGGGAAGTTCTATAAATTGGTGAAATCCATGAAGAATAGAAGGAAGAGTGTTCAAGTTGCAAAGAGTGCGGACAATTTAGTAGATGCCAAACCTACAGAAGTCGAAAAAACAGAAAAGAGGAAGTCGTCTTTCAAATTTCACCGAAGGTTTAGTCTTAGTAGGGGACGGGAGGACTGA

Protein sequence:

>DPOGS202824-PA
MGGAGWTREARVAEDSDFQTLKNLLSSEDGWTLEYEKDGVKVWAEDAAHGALRTVKVVAEFEDVDPEALYDVLHDPEYRSVWDTHMLAAEDAGHINVNNDVGYYAMSCPAPLKNRDFVLQRSWLDTGDEKMILNHSVYHKDYPPRKGFVSCVASSDILLQNGILNTFVLLYIKEILCSEEMSVLMSCPAPLKNRDFVLQRSWLDTGDEKMILNHSVYHKDYPPRKGFVRALSLLTGFVVRRRNGPGSWLGYVSRSDPRGALPAWLVNRVTAQLAPRLVHQLHAASRRYPGWKALTDTPYYQPWRNPEQVPPYRINLEDCIDPEAPPPPVEQKPNTSKTKLEVPQVKEVESHSIEDLGDISSEFSVEVEELADLTLDPTKKKGKFYKLVKSMKNRRKSVQVAKSADNLVDAKPTEVEKTEKRKSSFKFHRRFSLSRGRED-