Monarch geneset OGS2.0

DPOGS208213
TranscriptDPOGS208213-TA1569 bp
ProteinDPOGS208213-PA522 aa
Genomic positionDPSCF300179 + 255298-258306
RNAseq coverage572x (Rank: top 22%)
Annotation
HeliconiusHMEL0032317e-11478.19% 
BombyxBGIBMGA002260-TA1e-14654.64% 
DrosophilaCG9986-PA7e-9137.58% 
EBI UniRef50UniRef50_E2BIG57e-12444.67%Uncharacterized protein C12orf4-like protein n=8 Tax=Formicidae RepID=E2BIG5_HARSA
NCBI RefSeqXP_394347.37e-12243.04%PREDICTED: similar to Protein C12orf4 [Apis mellifera]
NCBI nr blastpgi|3072065272e-12344.67%Uncharacterized protein C12orf4-like protein [Harpegnathos saltator]
NCBI nr blastxgi|3072065272e-11844.20%Uncharacterized protein C12orf4-like protein [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[42-507] IPR0193116.5e-113Protein of unknown function DUF2362
Orthology groupMCL11781 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208213-TA
ATGACCGCTACGGAGTTAGAGAATGCTACAAAGACTTTTAAATTTTCTTTTCCAACATGCACGAACGAAGATCTGTTATTCAAATTAGAAGTGCCTGTGGAGATACCTTACCCAGGCTCCACGAGAGAGCTGGTGCAAAGAATATTAAAAATGTTCCACATTCCAGTATATTTGGAAGACGAGCTTAATGAGAAATTGGCTGATTTTGTATCAGAAGAAACAAGAAACTTTCACCACAATCGTGATGCTACACTGATTGACCAATTGAAGAACAATGAATTAGATTTGGAAGGTATTATCAAGAATTGGGAAAAGCAATTTAAAAATGTAGTTGACTTTGCTGAACAGAAGGGATCTTCCGACGAGGAGGTATTTGCTGCTGCTTACCACAAGCTGGTGCACTCCCCGGCTCTGGAGACTATACTACAAGTAGAGAGCGCTTATGCGAAGACTGTCATGGACATGATTCAGAACAGGGATGACGACATCAGGAAGCTTACAAAGAGGCAAACAGAAGAAATGGAAGAGAAAATCCGCCTTCTGAACACTTCCACTACAGAAGAGGAAATCAATACATTAGCAGCTAAGCATTTTGAAGCTCAGAGCCTTGCAACAGGTCGGTGGGACTCACAGCTGGATGCCTTGAAACACACACAGAGAGCAGAGCACCGCACCTGGCTCATGAATGCTATCAATGAATATCAGACTGAGGAGAAAATTACTCCCAGCAACTCTCCCCTGTGTTCGTACGCGTCGCTGCCACCCGCGCCGGCCGCTCCCGCCACCCTGCTGGAGGAGAGCTTCACCATACACCTCGGCTCTCAACTCAAACAAACACACAACATCAGGCTCGTATGCGCAGACATGTTAGACCTGTGCGCGCGAGACAGGACCGACGGTGGCCTGTCCCTGAGTCTATACTCGAGCGAGTTGTCGGGGGCGGTGGTGGTGTGCGAGGGTCGGCCGTCCCGCTCACCCTTGACCAGTCTGCCGCGAGTCACCGACCATCACTTCCCCGACTTGCACGACCAGCTGAGACGAATAGAGGAGGCGGTCGCCGACCCGGCGGAGACACGCAACAGAAGCCGCGGCGAGCGCGAGTCGCGGCGGCGGGCGCTGCGAGCGGGGGACGTGTTCGTGACGAGGCACAGCAACCTGTCCCAGCATGTGGTGTTCCACCTGGTGGCGGACGAGGACGAGCTGCGCTCGGCCGAGCTGAGCTCGCGGCACCGCGCCGTGCTGGGGCTGCGCGAGGTGCTGCTCGCGGCGCAGCGGAACGACGTAGCCAGCGTGGCGCTGCCGCTGCTACTGCGGCGCGAGCTGGGCGAGGATGCCACGGCTGCCTGGTGCCTGCGTCGCGCCGAGCTCGTGCTCAAATGCGTCAAGGGGTTCGTGCTGGAGGCGAGCGCGGCGGGCGGCGCGCGCCTTAAGACGCTTACGGCCGCCGTACCGCGGGAGGCGCGCGCTCTGTTCCCCGCCCTGGCCGCACTGCTGCCCGCAGTGTTCCGCGTCGCCGGGCCGCTCCGGCCGAGACTACCCTCGCACGAAGCGCCACGGCCGAGAGTCTAG

Protein sequence:

>DPOGS208213-PA
MTATELENATKTFKFSFPTCTNEDLLFKLEVPVEIPYPGSTRELVQRILKMFHIPVYLEDELNEKLADFVSEETRNFHHNRDATLIDQLKNNELDLEGIIKNWEKQFKNVVDFAEQKGSSDEEVFAAAYHKLVHSPALETILQVESAYAKTVMDMIQNRDDDIRKLTKRQTEEMEEKIRLLNTSTTEEEINTLAAKHFEAQSLATGRWDSQLDALKHTQRAEHRTWLMNAINEYQTEEKITPSNSPLCSYASLPPAPAAPATLLEESFTIHLGSQLKQTHNIRLVCADMLDLCARDRTDGGLSLSLYSSELSGAVVVCEGRPSRSPLTSLPRVTDHHFPDLHDQLRRIEEAVADPAETRNRSRGERESRRRALRAGDVFVTRHSNLSQHVVFHLVADEDELRSAELSSRHRAVLGLREVLLAAQRNDVASVALPLLLRRELGEDATAAWCLRRAELVLKCVKGFVLEASAAGGARLKTLTAAVPREARALFPALAALLPAVFRVAGPLRPRLPSHEAPRPRV-