Monarch geneset OGS2.0

DPOGS209574
TranscriptDPOGS209574-TA2040 bp
ProteinDPOGS209574-PA679 aa
Genomic positionDPSCF300015 - 1107303-1109960
RNAseq coverage669x (Rank: top 19%)
Annotation
HeliconiusHMEL0170480.067.98% 
BombyxBGIBMGA006634-TA2e-16655.42% 
DrosophilaCG7744-PA2e-3827.48% 
EBI UniRef50UniRef50_Q17AJ38e-3738.64%Putative uncharacterized protein n=2 Tax=Culicinae RepID=Q17AJ3_AEDAE
NCBI RefSeqXP_001650619.11e-3738.64%hypothetical protein AaeL_AAEL005281 [Aedes aegypti]
NCBI nr blastpgi|1571093153e-3638.64%hypothetical protein AaeL_AAEL005281 [Aedes aegypti]
NCBI nr blastxgi|2119386634e-3825.91%FI09618p [Drosophila melanogaster]
Group
KEGG pathway 
InterPro domain[38-150] IPR0186173e-24Protein of unknown function DUF2349
Orthology groupMCL25261 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209574-TA
ATGAAAAATCCCTATGTAATGGATTCCGCACTTATAATAGTACCTTCTGTCCTATGTTCTTTTTTAGTATTTAAACTAATATTAAATTTGCGGTCAACACTGCCTTGGCGAGTCAATTGTTGGTTTTGTAATTCAAACTTTTGGATAAAATACATTGAACGTAACAGCTGGACCTGTAAAAAATGCGAACAGTATAATGGATTCACTAAAGACGGTGATTATAATAAAGTATTAAACATAGAGAATGAGAAAGTTTCATTAAGTCCTAAATTATTCCAAAGAAGTCCACCCAAAAATGGCCTCTGCAAAATGTGTAATATTAACCAACAGCTCAAGGTGACACAATTATCAAATTTTGTTCCTATGAATGAAAATAAATATGATGAGGAGGTAGAATGTTACAGATTAAAATTAGAAAAAGCTTATAAATTATGTAGTCCATGCAAAAAAGTGCTACAAATGAAATTGCACAAAGAAAAAGAGACCTTATTGGGTTCGAAGCTTTTAGAAATGAGGACACCAGAGAAAAAACTCCAGAAATCAGCAAAGAGAAATGAATTTTGTAAAAACGTTATTAACAATGCATCCACGCTGATAGCAGGAGTGCTTATAGTATTAGTGACGGTTGAATGCTATGAAAATGCTTTGAAACACAAGAGTTTATCATATACATTAATTTATATTAAAGATATGTTGAACAATATATTACAAAGAATAATCTCTATAGTCAGAATGAAGACATTCCTCACATTCCCATCTCTAGAGAAACATTTTATGGTCGTAGATTATAACACAGATATATTTACAAACTTTAACAGCGGACTTAATGATTTGACTCAGAAAGCTTTAGGTGGCTTTGTATGTTTTATACAGATTGTTGGACATTTTTGGAATATAAATATATCGCATCACAGTGTTGCCATTGACTTGTTGTGGTCTGTATTTGTTATCACATCGATTGTATATGGGAATGTAGATGCGGACCCATTAATTATGAGCTTATTAAAATTATCGAGTACTCTGGCAGTATTGCTTATTTATATTAACATGAAACAAAAGAACTTAAGAAATGTGACGAGAAAAATTAACACACCAAAGAGCGGAACAAAATTACTACGAGGTACAAAGAGTATTAATGACGAAGAAGACAACATATCTTTGGATACAGACGACGATGTGTCTCTTAGTAAATTTGGTTTACACAACTTCACGGACTCATCGAATGATACACTTAATCCTTTGAGTGGTTTGATAAGTGGCAGATCCTTCACTCCCAGGAGTGACAGTCTTTGGAGTAAACCTAACATAAACACAGCTTTCACAGTAAACTCCGCATTGACTAACACACCAAAAAGCATATCAGATTCAGTTTTCATGAAGCCATCGTTCAACAAATATCAAAAAATCAAAGACGAATCCGATTCCGATCTAGACGAAAGTATAAATTCATTGTGCATAAGCAGTCCCAAGAAGAAGAGCCGGAAAATTAACCCGGTGTTCGCTCTGAGAAAGTTCACTGCGTCTCCAAATTTTATAATTCCGACACCCCAAAATCATTCGAGGCCATTGATATCACCGAGCAAATTGGGCCACAGTACTTCCTGGGTGGCGGGGGGCTACTGGGGCAATGAAGGCGGACAAATATTTAACGTGAACGGAAGCCGGTCTTCCAGTCAAAGTTCAGGGTTCGAGTCCCAAGCGTCGAGCATGAACCAACGAAACGTGTTCTCTCCGCCGTCGCGCGAGGAGTCCGTGTGCGGTGAACCAGAAAGAACGCTCTTGATGGACCGCTTCAGTAACAACACCAGCACCATGAACGGCTTCAACTACCCGAACCCGAGCTTCACTCCCATCGCGACGCCAGTCTTCCCACAGATGCAGTACAACAGTCACGTACAAATACCTCAGCCGAGATTCGCTCAGCAGACTTTCGTGTCGCAGAACGTTTTCGCCCAACAGTTCTCTCCCAATTCCACATTCAAGGCCCCCAGCGGCTCTCGTCTGATAAAGCTGCCCACAGACAGCTTCGCTCCCCGCTGA

Protein sequence:

>DPOGS209574-PA
MKNPYVMDSALIIVPSVLCSFLVFKLILNLRSTLPWRVNCWFCNSNFWIKYIERNSWTCKKCEQYNGFTKDGDYNKVLNIENEKVSLSPKLFQRSPPKNGLCKMCNINQQLKVTQLSNFVPMNENKYDEEVECYRLKLEKAYKLCSPCKKVLQMKLHKEKETLLGSKLLEMRTPEKKLQKSAKRNEFCKNVINNASTLIAGVLIVLVTVECYENALKHKSLSYTLIYIKDMLNNILQRIISIVRMKTFLTFPSLEKHFMVVDYNTDIFTNFNSGLNDLTQKALGGFVCFIQIVGHFWNINISHHSVAIDLLWSVFVITSIVYGNVDADPLIMSLLKLSSTLAVLLIYINMKQKNLRNVTRKINTPKSGTKLLRGTKSINDEEDNISLDTDDDVSLSKFGLHNFTDSSNDTLNPLSGLISGRSFTPRSDSLWSKPNINTAFTVNSALTNTPKSISDSVFMKPSFNKYQKIKDESDSDLDESINSLCISSPKKKSRKINPVFALRKFTASPNFIIPTPQNHSRPLISPSKLGHSTSWVAGGYWGNEGGQIFNVNGSRSSSQSSGFESQASSMNQRNVFSPPSREESVCGEPERTLLMDRFSNNTSTMNGFNYPNPSFTPIATPVFPQMQYNSHVQIPQPRFAQQTFVSQNVFAQQFSPNSTFKAPSGSRLIKLPTDSFAPR-