Monarch geneset OGS2.0

DPOGS202530
TranscriptDPOGS202530-TA1362 bp
ProteinDPOGS202530-PA453 aa
Genomic positionDPSCF300131 + 409304-410702
RNAseq coverage109x (Rank: top 59%)
Annotation
HeliconiusHMEL0074331e-12451.32% 
BombyxBGIBMGA001554-TA7e-1655.00% 
DrosophilaCG32075-PA1e-1038.27% 
EBI UniRef50UniRef50_E9ILT06e-3026.67%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9ILT0_SOLIN
NCBI RefSeqXP_001957865.18e-1824.11%GF23807 [Drosophila ananassae]
NCBI nr blastpgi|3407223388e-3026.12%PREDICTED: hypothetical protein LOC100646356 [Bombus terrestris]
NCBI nr blastxgi|3227958313e-3626.46%hypothetical protein SINV_14412 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[1-68] IPR0071749.4e-20Las1-like
Orthology groupMCL17948 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202530-TA
ATGCGTTTTGTGAACCATATGCTGGACACACAGACAGCTAAAGGGCAGAGTTTATTCCAGGCTGCTAAAAATTTAAATATCCCAGAATGGATAATAGATATGCGGCATGACACCGCTCATGGTAATAAGCTCCCCCAGATTGAGCTATTAAGAGAAGCCTGTTTATTAAGTTTAGAATGGCTTAAAAATTATTATTGGGATAAACATAAAGAATATATTGGAGATTATATTGTTGGTCAAGTGGCAATGCAAAGGAATGAAAGGGAAAACAAAATTGAAGCACTGATAAATTTTTGTGTATCTCTGAGTATTTGTTCACATCCAGGATGCAATGTTAGGACCATTGCTGACGTACCCGATGCCACTTTGAGAGAATCTTTAGTTAATGATGCAAAAGAAATTATAGGTGATAGTTTTGACTATTCCAACTTAAAAACTATCACAATAAAAAATCTACATCATTTTCTAACACCTACGGTAAAGAAAATACTTTATCATGACTCGGCCTGTACTTATGTTAATAAGATTTTATTATCTGAAAACTCTCTGTTTCTATCATTGGAACTCTATGTTCTTCTTGATAAATATAATTACCAGCTGAAAAAACAACTCAATAAAGCCTTTGTTCAATGTTTTGAAATGCTTCTACAATTTCTCCACACTTATGACTTGCTACAAGATTTCTTAATTGAATTAATAAAGAGAGTACAAAGTAATGAAATGTGTGACAAGAGACGGAAATTGGCTGCCCTTTGGGTGTCAGTTATATTGAAGGCTTTAGGGAAAGTTCAACTATTTCAGGAACAGGTTATGAAGAAAGCTTCAAAGGAATCAACATCTAGATCGAAAGACTTGAAATCTCTATTTTATCACTGGTTTCCAAATGAAAGGAACTGCCACCTATTATTAGATTTAAACAAACCGGTGCCAAAGGATTTAACTAATATAAATTTTATTCAACCTATAATTTCTACATACAATGAATATCTAACATGTTTCATCAAGGATCTCTTGACTTTGGTGAGACCCCAATTACCCCAACCAGTTATAAAGAAGCTGTGTGAATTGGCAAAAGCGATATCATCACCAAAAAAAATCAAAACAACCTCTAAAATATACACAGTAGATGATCTCGATTTGAATGGAGATACACTGAAAGATGAGTCTGTTATTGTGATTGATGACAACCCTAAAGATGTCGTGACAACCCTAATCACTGAGAGTGAACCAAAATGTGTAAATAAACAGAAACATGGAGTTTTTTATCTGGCTTCTCATGAGCATGCTTGGGCAACTTGCCCGATAGGTTTATTGCCATGGCAACAAGCACCAGCTGAACAAATGGATGTAGATATAAATTAA

Protein sequence:

>DPOGS202530-PA
MRFVNHMLDTQTAKGQSLFQAAKNLNIPEWIIDMRHDTAHGNKLPQIELLREACLLSLEWLKNYYWDKHKEYIGDYIVGQVAMQRNERENKIEALINFCVSLSICSHPGCNVRTIADVPDATLRESLVNDAKEIIGDSFDYSNLKTITIKNLHHFLTPTVKKILYHDSACTYVNKILLSENSLFLSLELYVLLDKYNYQLKKQLNKAFVQCFEMLLQFLHTYDLLQDFLIELIKRVQSNEMCDKRRKLAALWVSVILKALGKVQLFQEQVMKKASKESTSRSKDLKSLFYHWFPNERNCHLLLDLNKPVPKDLTNINFIQPIISTYNEYLTCFIKDLLTLVRPQLPQPVIKKLCELAKAISSPKKIKTTSKIYTVDDLDLNGDTLKDESVIVIDDNPKDVVTTLITESEPKCVNKQKHGVFYLASHEHAWATCPIGLLPWQQAPAEQMDVDIN-