Monarch geneset OGS2.0

DPOGS208781
TranscriptDPOGS208781-TA1182 bp
ProteinDPOGS208781-PA393 aa
Genomic positionDPSCF300036 - 793593-795605
RNAseq coverage474x (Rank: top 26%)
Annotation
HeliconiusHMEL0154354e-15076.00% 
BombyxBGIBMGA007640-TA5e-9956.63% 
DrosophilaCG16787-PA8e-5859.43% 
EBI UniRef50UniRef50_Q7PFX45e-6657.21%AGAP000173-PA n=1 Tax=Anopheles gambiae RepID=Q7PFX4_ANOGA
NCBI RefSeqXP_001658828.12e-6564.57%hypothetical protein AaeL_AAEL008025 [Aedes aegypti]
NCBI nr blastpgi|3479633092e-6557.21%AGAP000173-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479633095e-6357.21%AGAP000173-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[198-329] IPR0187901.1e-28Protein of unknown function DUF2358
Orthology groupMCL15643 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208781-TA
ATGGCCATTTGTTTGCGCACATTTTCACATAAGATTTTGCCCATAAATGGCATCGTAACATCAACGAAACATTCAAATCGCCTAACCCGAAAAATACCCGAACGAACACAATTTCTAGAAGACTTTCTCAGTAGCAGACGCATATCTAGTTGTCAACTAGCACCGAAATATTCGTCGGAAAACTTTATTGATGATATTCACGTCCAAGAGGTTACAATTGTCAATAGTTACCTAACTCGAAACTCGAACGAAAAGTCAAAACTAACCTTCATAGACAGTACAGATGGAAGGGAAAATAATGATAATATAGAACATTATTTCAACTACTACTCAGATGGTATGAAGACATTGGGCCTCACCAGTGATGATTATGTTGAACAGAGAACATTGTATAACTTAAATGAGAGGCCTTCCATTAGCATGAATGATGTTGATTTTTCCAAATGTCATGCGGATATAGAGCCAACATGTTTATCGGACTTTGAGAGGGATGCCATTGCATTCTGTAGTGGGCCAACAGTGGCGTCTCAGGCGACACCTCCTACAGAGGAGAAAACTATTGATGGAAAGCCCAGTGAAGAACAATTGATGAAAGTCTTTCATTCACTTTCGAAAACAATGCCACAGCTGTTTGTCAAGCCCCTGGACTATTCTATCTACCATCCAAATCTTATTTTTGTTAATAACATCAGAGGGGTGACTACAGTAGGTTTATTTCACTATGTGAAACAAGTGGCGCTCCTCCGCACAGTGGCTCATATAAAATTTGCATATGTTAATTTTGAAGTATTAAAAATAACAGCACACCCAGAAGATTCTTCAGTACGTATGAGATGGAGAATCAAGGGAATATCTGGTCTCAAAGTATTCTTTATGTTCTGGAAGTACAAGTTGTGGAACCTGAAAGAGGTTTTTCAAGATCAAGAAATGTGGTATGATGGATTTTCAACCTTCTATGTTGGCACAGATGGACTTATCCAGAAGCATGTGGCAGATAAGGCAAGTAATTGTGTGATGCCAGATCAGGACAGGATAATAGATGATGAGGAGAAGGCTCCAATTGCAGCTAAAATTGCTCTTCTTATTGGTCTAATCCCTAGAAATTACCTATCAGATTTGACACCTTTCTTATCTTCATCTGACACAAATGAATGCACACCATTTTATAAAGTACTAGAATGA

Protein sequence:

>DPOGS208781-PA
MAICLRTFSHKILPINGIVTSTKHSNRLTRKIPERTQFLEDFLSSRRISSCQLAPKYSSENFIDDIHVQEVTIVNSYLTRNSNEKSKLTFIDSTDGRENNDNIEHYFNYYSDGMKTLGLTSDDYVEQRTLYNLNERPSISMNDVDFSKCHADIEPTCLSDFERDAIAFCSGPTVASQATPPTEEKTIDGKPSEEQLMKVFHSLSKTMPQLFVKPLDYSIYHPNLIFVNNIRGVTTVGLFHYVKQVALLRTVAHIKFAYVNFEVLKITAHPEDSSVRMRWRIKGISGLKVFFMFWKYKLWNLKEVFQDQEMWYDGFSTFYVGTDGLIQKHVADKASNCVMPDQDRIIDDEEKAPIAAKIALLIGLIPRNYLSDLTPFLSSSDTNECTPFYKVLE-