Monarch geneset OGS2.0

DPOGS200894
TranscriptDPOGS200894-TA2283 bp
ProteinDPOGS200894-PA760 aa
Genomic positionDPSCF300066 - 389521-402073
RNAseq coverage481x (Rank: top 26%)
Annotation
HeliconiusHMEL0122240.074.74% 
BombyxBGIBMGA000545-TA0.075.16% 
DrosophilaCG12084-PA0.051.22% 
EBI UniRef50UniRef50_D6WI510.053.25%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WI51_TRICA
NCBI RefSeqXP_001850545.10.054.94%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700459350.054.94%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1582950780.055.05%AGAP005961-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00054885e-30binding
KEGG pathway 
InterPro domain[379-748] IPR0160245e-30Armadillo-type fold
[384-731] IPR0119892.8e-19Armadillo-like helical
Orthology groupMCL14522 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200894-TA
ATGTCGCTACTTCCTGATGTGTTTGCTTTGAACGACAGTTCACCTTCTAGTTTGTTTAACATAAGTATGGACTACATCGTGTCCAACTTAAATGTTATTACCAAATTGGATCCAAAAAGTCGATGGCGCCAACTGAAAGATGACATCATATTGCCTGCTGAAATCTGCGAAAGACTACTTGAGGCCTATCAGAAGAAGTATCGAATTAACGATAACATTGCAAATATGTTTCGTGACCGCACCAGAACGCGTTTAAACCGCGTGAAATTGCGTAATTCTCGCATTACTGATGAAGGTCTCCGTTGCTTGATGGAGCACAAGCCTTCCGAGGTCGAGCTGATTCAATGCGAATACCTTTCCCAAGCTTCTCTCGACATCATTAGCAACAACAGTGAAAATCTAGTCTCTCTCAAGTTTGGGCCACTCACTTATGCACTTTCTCAAGATGAAAATCTGTATCGTCAACGCAGATTTGTCATTGATGCTCCCAAACTGCAACGTTTGTCTATACATTGCCGAGGAATGTCAATCTTTCCTCTTTTACTTCTGAAACCTCTGCATAATCTAACACATCTTGATCTTTCTGAATTCTCATCTGCCGGTTCCACCTGGGCTCTGCATGAATTAAAGAATTTAGTAACGCTTGTTCTGCATAGTGTGCTTTGGTCAAAGGAAATCATTGATTGGATTGCTACATTGACTAATCTACGCCATTTGGATGTATCCCAACCAAATGAGAGACATGGGAAATATATGAACCCCAATGAAATATTGACTAAGTTGGTGACAAGTTTGCCTAATTTGGAATATTTGGATATATCTGGGACAAATTTGGCTGGCACTGGCTCAGCTGTTCCAGCTGCTACTGATGGAATGTCTGACTTGGAACATGTCCGTTGTGATATACCGGGCCTTGTTAGTAGAGTTGATAGGCCTTTAGAATTTCTGGGCCTGTATGGAACTCTTCATGGTGCGTGCAAACGTCATGACATACCAGCCAAAGTGATCACTGGAGATGCAAATGAAGAGCAGATCCTGATAGCTGGTTCTGTGTACATGGAGAGACCGGCAATGCTGACTCGTGTCCTCAACGACTTATACTACATGTTCCGCAGTATTGAACACTATAACGCATATGTTGGACGCGCTTTGAATGTCGTCTTGACAGCAATGGAGACCCATTGCTTGGAGAAACATATTCAGATATCTGGAAGCGCAACACTATTTTATATAGTAAAAGGGAAAGATAGATCTCGTATCGGCATACATCTGAAGCGGCGTATCATCAAGGTGTTGTTGGATGGTATGGAAGCTCACCTCGGCGATGATACTATGATGAGGAATGGCTGTCTCACGCTCTGCCAATTCAAAATACCCAATGATGTGTTGTTTGAATACGAGAGGGTAGTTCAGATTTTACTGAACGGTGTATCTGACGTCAATCAAGAGGGATTCGTGCAACGTATAGCGATATATTTGCTAAATTCGCTCGCCTGTCAGGTCGATGGCACTCAGAAACGCTTTCTGGGAGTCCACGGAGCTATCGGGAAAATGTTGAATCTGATCGCTGATCGTCTTGAGCGTCGTATCTGTGACGATGTTCTCGAAGTAGCCTGGTCTACGATGTGGAACGTTACAGATGAGACTCCTCACAACTGTCAACTGTTCCTCGATAATCGCGGGATGGAGTACTTCCTCGCTTGTCTCAAGAGTTTCCCCGACAAGGAGGAGCTTCTACGAAACATGATGGGGCTTCTAGGGAACGTCGCTGAAGTCCCCTCCCTCCGCCCACAGCTCATGAACAGATTGTTCCTGACTGTATTCTACGAACTGCTCGAATCTTCCAGTGATGGTATTGAGGTGAGCTACAACGCGGCCGGTGTCCTAGCTCACATGGCGTCCGATGGTCCCGAGGCTTGGACTGTGTTCGAACCAGCTCGTGATGCTGTACTGGAACGTATCGCCACAGCCGTTGAGAGCTGGGACCGTAGGGCTGAACGGAATATTAATTATCGTTCGTTCGAACCTATACTCAGTCTGCTCCACGCTCATCACACGCCGCAGTGCCAACACTGGGCTGTTTGGGCGTTGGCCAACCTGACAACTGTATACCCCGACAAGTACTGCGGCTTGGTGGAGGCTGAGGGTGGGCTTCGTCTGCTGAATGCTCTGCTGCAACATCCCGGTCCATACAGCATGATAAAGGACCTGGCGCGCCTCGTCATAGACAACTGCGCCCGCCACTCCCAGGATGCCCCACGCGCGGACACACCGGACAATTAA

Protein sequence:

>DPOGS200894-PA
MSLLPDVFALNDSSPSSLFNISMDYIVSNLNVITKLDPKSRWRQLKDDIILPAEICERLLEAYQKKYRINDNIANMFRDRTRTRLNRVKLRNSRITDEGLRCLMEHKPSEVELIQCEYLSQASLDIISNNSENLVSLKFGPLTYALSQDENLYRQRRFVIDAPKLQRLSIHCRGMSIFPLLLLKPLHNLTHLDLSEFSSAGSTWALHELKNLVTLVLHSVLWSKEIIDWIATLTNLRHLDVSQPNERHGKYMNPNEILTKLVTSLPNLEYLDISGTNLAGTGSAVPAATDGMSDLEHVRCDIPGLVSRVDRPLEFLGLYGTLHGACKRHDIPAKVITGDANEEQILIAGSVYMERPAMLTRVLNDLYYMFRSIEHYNAYVGRALNVVLTAMETHCLEKHIQISGSATLFYIVKGKDRSRIGIHLKRRIIKVLLDGMEAHLGDDTMMRNGCLTLCQFKIPNDVLFEYERVVQILLNGVSDVNQEGFVQRIAIYLLNSLACQVDGTQKRFLGVHGAIGKMLNLIADRLERRICDDVLEVAWSTMWNVTDETPHNCQLFLDNRGMEYFLACLKSFPDKEELLRNMMGLLGNVAEVPSLRPQLMNRLFLTVFYELLESSSDGIEVSYNAAGVLAHMASDGPEAWTVFEPARDAVLERIATAVESWDRRAERNINYRSFEPILSLLHAHHTPQCQHWAVWALANLTTVYPDKYCGLVEAEGGLRLLNALLQHPGPYSMIKDLARLVIDNCARHSQDAPRADTPDN-