Monarch geneset OGS2.0

DPOGS208919
TranscriptDPOGS208919-TA1140 bp
ProteinDPOGS208919-PA379 aa
Genomic positionDPSCF300009 - 326434-329258
RNAseq coverage3671x (Rank: top 3%)
Annotation
HeliconiusHMEL0026400.090.64% 
BombyxBGIBMGA002493-TA0.090.77% 
DrosophilaCG10576-PA2e-14265.83% 
EBI UniRef50UniRef50_Q5U0Z22e-14163.54%LD30448p n=64 Tax=Eukaryota RepID=Q5U0Z2_DROME
NCBI RefSeqXP_969584.19e-15969.43%PREDICTED: similar to CG10576 CG10576-PA [Tribolium castaneum]
NCBI nr blastpgi|910933632e-15769.43%PREDICTED: similar to CG10576 CG10576-PA [Tribolium castaneum]
NCBI nr blastxgi|910933634e-15570.21%PREDICTED: similar to CG10576 CG10576-PA [Tribolium castaneum]
Group
Gene OntologyGO:00099875.4e-63cellular process
KEGG pathwaymcc:7114075e-108 
 K05084 (ERBB3, HER3)maps-> Endocytosis
    Calcium signaling pathway
    ErbB signaling pathway
InterPro domain[5-374] IPR0045451.1e-157Proliferation-associated protein 1
[307-332] IPR0009945.4e-63Peptidase M24, structural domain
[237-305] IPR0119912.1e-08Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL13270 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208919-TA
ATGGCTGACGATAAAGAAGTGGAAAAAACTGTTGCAGAAGACTTGGTTGTAACCAAGTATAAATTAGCGGGGCAAATTGTTAACCGTGTTTTAGAACAAGTGATTGCTAAATGCATACCAGAGGCGTCTGCACGTGAAATATGTGAATTTGGTGACAAATTGTTGCTTGACGAAACTTCTAAAGTCTTCAAAAAAGAGAAAGATTCCAAAAAAGGCATCGCATTTTCGACATGCGTATCTGTAAACAATTGTATATGTCACTTTTCTCCAATACCAAGTGAAACGGATTACATCCTGAAAGAGGGTGATTTAGCTAAAATTGACCTTGGAGCTCATATAGATGGCTTCATAGCAGTGGTAGCACATACAGTGGTGGTAGGCGGCGGTGAGGTCACTGGTAGAGCAGCTGATGTCTTACTAGCTGCTCACTATGCAAGTGAAGCTGCTTTGAGACTCCTCAGACCTAGCAATGAGAACTATGCTATCACTGATGTTGTTCAAAAGATAAGCGCAGAGTATGGCTGTAAGCCGATAGAGGGTATGTTATCCCATCAGCTGAAACAGTTCAGAATTGATGGAGAGAAAAGCATTATCCAGAATCCATCAGAAGCACAACGGAAGGAACATGAAAAGGCTACTTTTGAAACTTATGAAGTTTATGCTATGGATGTATTAATCTCCACTGGGGAAGGTGTTGGCCGTGAAATGGATACTAGGTGTACTATATACAAGAAAACAGATGAAATCTATCAACTTAAATTAAAATCATCTAGAACATTCTACAGAGAAGTCCGAAATAAACATGGGTCAATGCCTTTTAACTTAAGATCCTTTGACAAAGAAACAAGTGCAAGACTGGGTGTAGTTGAGTGTGTTAACCACAAACTCATTGAGCCCTTCCAGGTGCTATATGAGAGACCAGGTGAATTGGTAGCACAATTCAAGTTTACAGTACTCTTACTGCCGAGTGGCACTCATCGCATAACGGGTTTGCCATTTGACAAGAACCAGTGTAAGACTGAACGGACCATTAAGGACCCAGAACTTAATGCCCTTCTGAATTCTTCGGCGAAATCAAATAAAAAGAAGAAAAAGAAGACCGGTGCCGAGGAAGCGATGGAAGTTGAAACAGCTGCCTAG

Protein sequence:

>DPOGS208919-PA
MADDKEVEKTVAEDLVVTKYKLAGQIVNRVLEQVIAKCIPEASAREICEFGDKLLLDETSKVFKKEKDSKKGIAFSTCVSVNNCICHFSPIPSETDYILKEGDLAKIDLGAHIDGFIAVVAHTVVVGGGEVTGRAADVLLAAHYASEAALRLLRPSNENYAITDVVQKISAEYGCKPIEGMLSHQLKQFRIDGEKSIIQNPSEAQRKEHEKATFETYEVYAMDVLISTGEGVGREMDTRCTIYKKTDEIYQLKLKSSRTFYREVRNKHGSMPFNLRSFDKETSARLGVVECVNHKLIEPFQVLYERPGELVAQFKFTVLLLPSGTHRITGLPFDKNQCKTERTIKDPELNALLNSSAKSNKKKKKKTGAEEAMEVETAA-