Monarch geneset OGS2.0

DPOGS213376
TranscriptDPOGS213376-TA1872 bp
ProteinDPOGS213376-PA623 aa
Genomic positionDPSCF300109 + 202234-206189
RNAseq coverage661x (Rank: top 19%)
Annotation
HeliconiusHMEL0163829e-14963.19% 
BombyxBGIBMGA009122-TA0.064.90% 
DrosophilaCG11123-PA4e-11134.76% 
EBI UniRef50UniRef50_B4KND52e-11136.35%GI18760 n=3 Tax=Drosophila RepID=B4KND5_DROMO
NCBI RefSeqXP_002006053.13e-11236.35%GI18760 [Drosophila mojavensis]
NCBI nr blastpgi|1951231116e-11136.35%GI18760 [Drosophila mojavensis]
NCBI nr blastxgi|2700106312e-11537.24%hypothetical protein TcasGA2_TC010059 [Tribolium castaneum]
Group
Gene OntologyGO:00054882e-42binding
KEGG pathway 
InterPro domain[56-604] IPR0160242e-42Armadillo-type fold
[335-414] IPR0119898.7e-23Armadillo-like helical
Orthology groupMCL13859 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213376-TA
ATGGGAGAAGAATCTGTTGATAATAATCCTCTCAAAAAGAAGCAGAGGAGGAAACGAAAGAATTTCTTGTCCAACGCCAAAAAATACGCTAAAAAAGGACAAATGGGTAGAGGGACTAAGATCCCTGAAGAACTTTATCAATATTTTGTTGGTATTTTGGATGTCATCAAACAAGGAGTGGAGAATCCGGAAGAGAGAGAAGCCCTTGTCAACAATGTTTTGGAGAGAACTAAAGGGGAAGAATTAAATGTAGTTGGTAACCAGCTGGGATGTCGTATGGTGGAGATCCTCCTACCCTATTCTTCAGCTGAAGATCTGGAACGTTATATCGAGGTACTATCACCGGAACTGCGACCGTTATGTTCAGACAACATTTCAAGTCATGTGGTGGAAACATTACTTAGAGTGGCCTGCGAGAGAGCTACAGAGCATTTACAAATAAGTGATGAAGAAAAAGTTCCCAAGAAAAAGAAAAAAGTGGAACCCAAGTACAAAGACGACCACATAAAAAAATGTAATGAATTCACATTGAAGATATGTAAATATGCACTAAATAATTTGGAAGACTTCGTCTGGGATAAATATGCCAATCATATTTTACGCAGTGCTATCAAATGTCTCAGTGGCATAACACTGCTGCCGGGAGAGAAACCTAAAGTTAATATGTTCAGGGAAACAGTAAACCAAAATAAAGGAATACCTCCGCATCTGACGGAGTTGAAGTACAAAGAAGTTCCTGAAGAATATAAAGACATTGTTAAAGAATTCGCTAACAGATTGTCTTCGTGGCCTCAATTTAAGGACCTGCCGAATCAAAACATAACATCGGCATTGTTACAAGTGTTATTATACGCTGTTAAGAATGTCGATGCAAATATATGCAAGAGTTTAGTCAACAAGTTGCTGAACGAAAGCTTTGCTCCAGAAGATTGGGTTCCGACGGGTGAAGATGAGAAAAAAGATCAAAAAGATCAGAAGACTGTTGACAATGAAGCTGATGTTGCATCATCAGACTTTGATTGTATATCAAATCAAAAAGGGTCCAATAACCTCCGTGCGTTATTCGTATCAACTAAGAAATCATACACACAGATATACGCCCGCTGCTTCATCAATCGCCTCGCTAAACTCTGCACTATGTCCATGTTGAATTTCACCGTACAGAGACTCATAGACAACTGTCAGATCAAAGAGGAATTTGAACCAATGTTTGAAGAGCTATCGAGTCAGTTCAACTCCATACTGTCGTGTGGAAACACTGGTGTGTTGGTGGCTCTAGCTAAAGGATGTCTGCGGGTCAAGGCTAAGCAGATGCAGATGATGAACATGTTGGAGACGGTCCTCAACTGTTCGGAAGCTGAACCGAAGCGTTTCATAATCCAATGTTTGCGACTGGTTCCTCTCAGTGACAAAACAATGGACACTACCAATTATTTTATACACGTCCATGGATCAGTGATAGTTCAGACGATGCTCGGATTCCAGCGTCCCGCCAAGCTGGTGTCCAGTATACTGGAACTGTCCTCGGAGGAGTTGGTTTACATTCTGTGTGATCCCAAGGGCTGCCACATCGCTGATGCCTTCACCACACAGAATCTGGTTGGAGTAAAGGCCAGGGATAAAATGGTGTGGAAGTTGAAGGGCTACTATCAAAACATGGCTATATCTCAGTACGGTTCGCGAGCTTTCGAACAGGTCTTCGAAGCGGCATCCATGGAGCAGAAAGTGAAAATAATGAAGGAAATGTCAGACAAGAGCAACTTACTGAACACCACCAGCTACGGACGACTTATAGCCACCAAACTTGATATCAGTACATTCAAGGCTTCACAGAAGAAGTGGGAACAGAGCAGGACCGAGAAACCTGAATAA

Protein sequence:

>DPOGS213376-PA
MGEESVDNNPLKKKQRRKRKNFLSNAKKYAKKGQMGRGTKIPEELYQYFVGILDVIKQGVENPEEREALVNNVLERTKGEELNVVGNQLGCRMVEILLPYSSAEDLERYIEVLSPELRPLCSDNISSHVVETLLRVACERATEHLQISDEEKVPKKKKKVEPKYKDDHIKKCNEFTLKICKYALNNLEDFVWDKYANHILRSAIKCLSGITLLPGEKPKVNMFRETVNQNKGIPPHLTELKYKEVPEEYKDIVKEFANRLSSWPQFKDLPNQNITSALLQVLLYAVKNVDANICKSLVNKLLNESFAPEDWVPTGEDEKKDQKDQKTVDNEADVASSDFDCISNQKGSNNLRALFVSTKKSYTQIYARCFINRLAKLCTMSMLNFTVQRLIDNCQIKEEFEPMFEELSSQFNSILSCGNTGVLVALAKGCLRVKAKQMQMMNMLETVLNCSEAEPKRFIIQCLRLVPLSDKTMDTTNYFIHVHGSVIVQTMLGFQRPAKLVSSILELSSEELVYILCDPKGCHIADAFTTQNLVGVKARDKMVWKLKGYYQNMAISQYGSRAFEQVFEAASMEQKVKIMKEMSDKSNLLNTTSYGRLIATKLDISTFKASQKKWEQSRTEKPE-