Monarch geneset OGS2.0

DPOGS209005
TranscriptDPOGS209005-TA1956 bp
ProteinDPOGS209005-PA651 aa
Genomic positionDPSCF300209 - 102473-104528
RNAseq coverage0x (Rank: top 97%)
Annotation
HeliconiusHMEL0025405e-14656.83% 
BombyxBGIBMGA012553-TA1e-11252.07% 
DrosophilaCG10979-PA8e-0834.48% 
EBI UniRef50UniRef50_E9JD441e-0826.15%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9JD44_SOLIN
NCBI RefSeqXP_002013775.18e-0935.42%GL24323 [Drosophila persimilis]
NCBI nr blastpgi|3227789125e-0826.15%hypothetical protein SINV_15618 [Solenopsis invicta]
NCBI nr blastxgi|1948826911e-1421.71%GG25360, isoform A [Drosophila erecta]
Group
KEGG pathway 
Orthology groupMCL19882 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209005-TA
ATGAGTACAGTTGATGTGAAACAACAAGAGACCGAGGATTACAGAGATGACGACACTCAAGACGTGCCAGCTGGGAACACTCTACAGTATTACACGAATACGCTTATAAATAAATTACCATTCGCCCAACAAGCGGAGAAGAGCAATCTGAACGCGTTCAAAAAGAGATTACAATCAGATGTCGAAATAGATCAACTTTTATGTAAAAAATGTAATAGTAAATTTGAACAAATAGGTGAATTATTAGAACACGTCGCTGGACATTATAAATGGTTGCGCTACGCCTGTAAACTTTGCAACTTCAAGCATTTCAACTTTGATAAACTCCCGGAACACGTTAAAGTTGTCCACAAACTCAAAGGCGATACTGATTTCTACTATAGTACCGTAAAAGCCATAGACGGTTCGGAAGCCAGCGAACTATCTTCCCCCGTGGAAGAATTAACCGAATCTAATGAAACTAGTCCAGATTCACGACGTCCAAGCAGATGTTCTAGTGACTCCAGCAGATTATCTGACGATAGCTCCTCCAGCAGTACACGAGTCGAAACCGGTTCGAGAAAACGCAAAGCACGACTGGTCAAAAACATCGGAAAGAAGAAAAAGGATACTGTTGTTATAGATGACAACGAAGAAAGTAAAGAGGTTATGCATAAAGGAGTTTTGTTAGGAGAAAATGATTCGTCCTCCAATTCAAAAATATTCGAAGAAAATTCATCAGATTTGGATGAAGTTGATGAGAAAATAGCAAAGCGCGAAAACATGACATCCGTAGCATGCCGTAGACCAGTTCGTAAGAAAACTAAACGCAAGAACGAAGATTTCGAATACGATCTGTCGAATTTGTTAAAAATGGAAGCGCAGGGCTATCGCGATTCACAAGTCACACCAAAAACTGCTCCTTCTAAGAAGAAAGTACAACAAGATGTTAACCCTCAGTACGAGCTCATCAACAAAGAGTGTTGTGGTGCACTAGTGACGATGTCGAGGTCCTCGGTAGAAAAAGCTCAAGCCCATATGAAGACTGCAACCTTTGCTGTGTTTAACACTTCAAAAGAACCTCGTGTATCAAATATTTTTGTGAGGCCTCTGGTGCCTAAAATTAATAGAGTAGATAAAATATCGCCTAAGAAGGCTGAAAATGAAGAAACAAAAGAAATCTCCCACCCTAGTCCCACTAAAATAATAGACGCCTCCACTCTATCAAATCTCTGTAAGGAATTGGTGATAACTAAAGTTGTAAATAAAAAATGTGAGGAAAAAGAAGCAAATGTATCCGCTAATGAAACTCCAAAAGAAACCGAGCCGATACCTCAAGTAGATAATAAAACGGTCAGCGACGACAGTAAAGAGAAAAAGGAAGAAAAGAATAAGGTATCTGAAATAGAAGCAAAATCTGACGAGAGTGCCTCATCCGAACAAACTAAAACTAATGTGAATGTACCAACAATACTTCCTATAAAATTCCGAAGACAAAGTTTGGAGGTTATACAAAATCCCTTAATAAAGAAAAATATCACAGACTTCACAAAAGCCGGTATGAAAACTAAAATTTTGGTAATCAAACCCATCAATAGGAGCACCGATGGAACAAAAACACTGAAATTTCAAACAATAAAATTGAAAGATCCGAACAAGACCACCACGAAAAATGATGAAATGAAAACCGAACAGGTCGTCGTTGTGAAAGTTCCCAAAGTGGATTGTTCTATAAGCAGATCAATACCAGCCAGCGACGCCCCTGTGGCACTCGACGAGAAATGTGATGAGAATGAAAACGAAAAAGTTAAAACGAATGCTGCAAATCCATCAAATCCTACCGGTGAAAACAGTGTGGAAGAACCTAAAAAAGACATTAAAATAGAAAATGACATAACTGACTTGGTGGAAGACAAACCCGAATCAAAATTAATAGAATGTATAGAATTGGAAGAGGCCGTGATGCAATCTGGTTGA

Protein sequence:

>DPOGS209005-PA
MSTVDVKQQETEDYRDDDTQDVPAGNTLQYYTNTLINKLPFAQQAEKSNLNAFKKRLQSDVEIDQLLCKKCNSKFEQIGELLEHVAGHYKWLRYACKLCNFKHFNFDKLPEHVKVVHKLKGDTDFYYSTVKAIDGSEASELSSPVEELTESNETSPDSRRPSRCSSDSSRLSDDSSSSSTRVETGSRKRKARLVKNIGKKKKDTVVIDDNEESKEVMHKGVLLGENDSSSNSKIFEENSSDLDEVDEKIAKRENMTSVACRRPVRKKTKRKNEDFEYDLSNLLKMEAQGYRDSQVTPKTAPSKKKVQQDVNPQYELINKECCGALVTMSRSSVEKAQAHMKTATFAVFNTSKEPRVSNIFVRPLVPKINRVDKISPKKAENEETKEISHPSPTKIIDASTLSNLCKELVITKVVNKKCEEKEANVSANETPKETEPIPQVDNKTVSDDSKEKKEEKNKVSEIEAKSDESASSEQTKTNVNVPTILPIKFRRQSLEVIQNPLIKKNITDFTKAGMKTKILVIKPINRSTDGTKTLKFQTIKLKDPNKTTTKNDEMKTEQVVVVKVPKVDCSISRSIPASDAPVALDEKCDENENEKVKTNAANPSNPTGENSVEEPKKDIKIENDITDLVEDKPESKLIECIELEEAVMQSG-