Monarch geneset OGS2.0

DPOGS211024
TranscriptDPOGS211024-TA1266 bp
ProteinDPOGS211024-PA421 aa
Genomic positionDPSCF300004 + 1534670-1548371
RNAseq coverage160x (Rank: top 52%)
Annotation
HeliconiusHMEL0221556e-13787.05% 
BombyxBGIBMGA006504-TA1e-10588.02% 
DrosophilaCG31033-PC7e-7836.67% 
EBI UniRef50UniRef50_Q86BR61e-7536.67%CG31033, isoform C n=31 Tax=Coelomata RepID=Q86BR6_DROME
NCBI RefSeqXP_968273.19e-10956.42%PREDICTED: similar to AGAP002315-PA [Tribolium castaneum]
NCBI nr blastpgi|910794842e-10756.42%PREDICTED: similar to AGAP002315-PA [Tribolium castaneum]
NCBI nr blastxgi|910794843e-10556.42%PREDICTED: similar to AGAP002315-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055153.1e-18protein binding
KEGG pathwaycin:1001857422e-07 
 K04539 (GNB5)maps-> Chemokine signaling pathway
InterPro domain[6-200] IPR0139231.4e-39Autophagy-related protein 16
[257-365] IPR0159433.1e-18WD40/YVTN repeat-like-containing domain
[272-309] IPR0197812.1e-11WD40 repeat, subgroup
[270-309] IPR0016804.8e-11WD40 repeat
Orthology groupMCL15827 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211024-TA
ATGGAAGGCGGTGAGTGGAGAAAAGACATAATAAATCAATTGCAATCACGAAACAAACGTGAAACCAGCACATTTCAGGACATCATAGCGTTTCAAAGCAGACTCTTTGACAATGTTAGTACCCTGAAAAATGAAAATCTACAGTTGACATTGATGAATGAGAGAATGCGATATTCTAACAATGAGAGTGTTTCAAGTGGAGGAAATCCTTCATTTGAGAAAATACAAGCTATGGAACAAAGGATTCTAGCTCAACAGGAAGAATTGACTTCATTGCACAGAACAAGAGGTGAAAATGCACAGGAGATAATAAACCTTAATGCGAGGGTGCGGGAATTGGAAAAAAGTTTACAATCAAAGGATATACTAATTTCAGAAAATATGGCACTAATAGCTTCACTACGAGCTGAAATACAGATGTATGATACAAATATGAATGAACTACAAGGCTTGAATCAAATGTTGAGAGACGAACATCAAGCACTGCAGATTGCATTTGCTTCGATAGAGGAAAAATTGAGGAAAGCTCAGGATGAAAACCGTTCGCTCGTCGAGAGACTTATTAAGTACAAAGCCAAAGATGCCGATAAAATGAATGAAGAGAACGAACATTTCCTAAAGTCAAACAACCCTACTGCGTTTTTCATCAACACGTTTGGTAGAGTTAGTTTCGGAAAAAAAAGTGACAAAGTTCGCAAAGAGTTAGAAGAAGCGGCTAGAGAAGGTTGCAGTCGGGGTAGTGACGGTTCGGGCGGTAGCGTTGATGACAAGATAATGGATTCAATGCCATACTACGCAACCACTTTACCCACTAAAGTTTTATCAAGTTTTGACGCTCACGACGGCGAGGTGAATGCCGTCAAATGGAGTCCTACTGACCGCTTAGTGGCGACCGGTGGAGCGGATAGGAAAGTTAAATTATGGGATGTCTCTAAAATGGGTCTGGTAGAAAACAAGGGTGCGTTGGTGGGTTCGAATGCTGGTGTGATGTCTGTTGACTTTGATGCCACCGGCGTCTACATAGTTGGGGCCTCGAACGACTTCGCGAGTCGCGTGTGGACTGTCGGGGACCAACGGCTAAGGCGCGAGTTAGATTCCAAGCGCCGTTTTGCTAATGGTTGTTGTCAGATCGGCACGTGTCTGTGGTCGCGTGCCGCTGTCGCTTGCACACAACGATCTAATCTCGGGCACGAATGCCGACGTGAGGTGTGCCCCTTCACACGTGATTGTTCCACGCAAACCAACTTAGAGATCACATCTACTTGA

Protein sequence:

>DPOGS211024-PA
MEGGEWRKDIINQLQSRNKRETSTFQDIIAFQSRLFDNVSTLKNENLQLTLMNERMRYSNNESVSSGGNPSFEKIQAMEQRILAQQEELTSLHRTRGENAQEIINLNARVRELEKSLQSKDILISENMALIASLRAEIQMYDTNMNELQGLNQMLRDEHQALQIAFASIEEKLRKAQDENRSLVERLIKYKAKDADKMNEENEHFLKSNNPTAFFINTFGRVSFGKKSDKVRKELEEAAREGCSRGSDGSGGSVDDKIMDSMPYYATTLPTKVLSSFDAHDGEVNAVKWSPTDRLVATGGADRKVKLWDVSKMGLVENKGALVGSNAGVMSVDFDATGVYIVGASNDFASRVWTVGDQRLRRELDSKRRFANGCCQIGTCLWSRAAVACTQRSNLGHECRREVCPFTRDCSTQTNLEITST-