Monarch geneset OGS2.0

DPOGS208878
TranscriptDPOGS208878-TA1191 bp
ProteinDPOGS208878-PA396 aa
Genomic positionDPSCF300009 - 1320607-1322367
RNAseq coverage168x (Rank: top 51%)
Annotation
HeliconiusHMEL0039000.075.32% 
BombyxBGIBMGA004637-TA0.074.69% 
DrosophilaCG12320-PA4e-6533.25% 
EBI UniRef50UniRef50_E2BZD72e-10647.98%Uncharacterized protein C20orf4-like protein n=3 Tax=Endopterygota RepID=E2BZD7_HARSA
NCBI RefSeqXP_972058.13e-10747.57%PREDICTED: similar to Uncharacterized protein C20orf4 homolog [Tribolium castaneum]
NCBI nr blastpgi|910771705e-10647.57%PREDICTED: similar to Uncharacterized protein C20orf4 homolog [Tribolium castaneum]
NCBI nr blastxgi|3071978361e-10247.98%Uncharacterized protein C20orf4-like protein [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[1-397] IPR0079462e-119A1 cistron-splicing factor, AAR2
Orthology groupMCL12662 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208878-TA
ATGGATCAAGAAACTGCTAAAAAGCTGTTAGTGGAAGGAGGAACTTTTATATTTCTCGGAGTGCCCCAAGAGACTCAATTTGGAATTGATATGCAATGTTGGAATACCGATGAGGATTTTCGTGGAATAAAAATGATTCCGCCGGGCCTTCACTATGTTCATTATGCTGCAGTGAATAAAGACACAGGCGATGTTTCTCCTAGGTCAGGGTTTATGCACTATTTTGATAAAAAAGAATTTTTGGTAAAAATGTGGGATAAACATTTGGAAGATATTAGTAGAGAAGAAATAAGTGAAGAAAGTATTCAGCGTTTAAGGGAGAATTTACTTAATATAGATAAACATTTGGCCCCATACCCATATGAAATATGGCAGAAATGGAAACTTTTATCTTCACAAATCAATGCTGATCTAGCAAAAAAACTATCCCCTGAAACTGGCTTGATAAGATCATCAGTGGAACTGCTCTCAACAAGTGACGCTGATAGACCTCGAGGTGTTAAAGTAACTGAAAATTCAGAGGTTTCAACCATCACTGAAAATAATGATGAAATTTCAAATCCCAGTCAATCAGGTCTCAAAAGGACGAGAAGATCCACACAACAAGAAAAAGAAGAAGCTATGTTGCCAAACTTAAAACCTGCACCTGGAATGTCAATGAGATTTACAGAAATACCCAAAGACAAATACCCACCTGGTTCAGCGCCGGAAGAAATAACTAAACATTATTTGGACCAGTCTTACACATTGGAACTTATGATTAGAGCACACGATGAGCCTCTTTATATAATAGGTGAAATGCAATTTGCATTTTTGTGTTTTCTCATCGGCCATTCCCTTGAAGCATTTGAACATTGGAAAAGTATGGTAATGTTGTTCTGCTCTTGTGAAGATGCAATTCACAAATATAGAAGTGTATATTTTCACTTCATTAAAACTATTGAAATTCAAATTGATGAGATGCCCGAAGAATTTCTGGCAGATATTGTCATGAACAAAAACTTGGTGTATAAAAAATTACGAGAACTATTCCGAACTGCGTACATGAGTAAAGTTGATGGTCGTCTACTCACATTGATTGAAAGCCTCAAAGAAAATTTGTCACAGAAGTTACAATGGGACTTTACAGGATTAGATTCCGATGAAGATGATGAAAGACCAGTGGTTGTAAAATTAAATGACACAGATTAA

Protein sequence:

>DPOGS208878-PA
MDQETAKKLLVEGGTFIFLGVPQETQFGIDMQCWNTDEDFRGIKMIPPGLHYVHYAAVNKDTGDVSPRSGFMHYFDKKEFLVKMWDKHLEDISREEISEESIQRLRENLLNIDKHLAPYPYEIWQKWKLLSSQINADLAKKLSPETGLIRSSVELLSTSDADRPRGVKVTENSEVSTITENNDEISNPSQSGLKRTRRSTQQEKEEAMLPNLKPAPGMSMRFTEIPKDKYPPGSAPEEITKHYLDQSYTLELMIRAHDEPLYIIGEMQFAFLCFLIGHSLEAFEHWKSMVMLFCSCEDAIHKYRSVYFHFIKTIEIQIDEMPEEFLADIVMNKNLVYKKLRELFRTAYMSKVDGRLLTLIESLKENLSQKLQWDFTGLDSDEDDERPVVVKLNDTD-