Monarch geneset OGS2.0

DPOGS203780
TranscriptDPOGS203780-TA3279 bp
ProteinDPOGS203780-PA1092 aa
Genomic positionDPSCF300010 + 810298-833788
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0085260.088.95% 
BombyxBGIBMGA003526-TA0.078.93% 
DrosophilaCG34370-PG8e-1225.76% 
EBI UniRef50UniRef50_UPI00020624F10.043.15%UPI00020624F1 related cluster n=1 Tax=unknown RepID=UPI00020624F1
NCBI RefSeqXP_002428475.13e-13938.27%hypothetical protein Phum_PHUM383910 [Pediculus humanus corporis]
NCBI nr blastpgi|3287217210.043.15%PREDICTED: hypothetical protein LOC100572104 [Acyrthosiphon pisum]
NCBI nr blastxgi|3287217210.043.27%PREDICTED: hypothetical protein LOC100572104 [Acyrthosiphon pisum]
Group
KEGG pathway 
InterPro domain[242-352] IPR0008593.9e-15CUB
Orthology groupMCL17761 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203780-TA
ATGCGGCAAGCGGTTGCGTTGCTACTTACCGCATGTGCTGTCTCTGCACAGATTGCGTGGACACCTATATTCTTAGATGGCAGCAGTCAAGTCGATCTGTGGACGCAATCATCATCAGGTTGTGCATGCCCAGGTCAAGGAGGAGAAGAGTGCGCCTGTTGCGTGCGTGACGGAGCATGTCCCTGTGGGGATTTTGCTCCCGGAAGATGCGCTCAATGTGGACTTGAACAATTTTGTTCGAATATGTGCAATATGACTATTGACTCGCGAGTGCTCAAGAATAAGTCTGGCAAAACTTTCGGCCAGATCAAATCTCCATCCATAGAGGGTCCCAGCACATGTTCCTACATTCTTCAGCCCGATGCAGGGCAGCGAGTTGAAATTCAAGTGTATCGCCTTGTTTCTGTTGGGAGATACAACGGGTCCAGTTGTGTAGGTGGATGGTTGCAACTGAGTGGTGCCAGTGCCGTGCAGGATGCCAGTGGCGCAGCTGACACTAGATTGTGTGGTACTAACGAACGGTATACACCTCCCGTTGTATTATTTGCTGACCACGGCGCTGCTGCTCTTGATTTTAGGATAACTGAGAAGACTGAAAGATCCCAGTTCCTGGCTTTCTTCAGTTTTACATCTTTAAGCAATACACAGGGCGTTGGTTTCCATCCCAGAGGTGGCTCCAGAATTCCTAATACAGATTGTGACTGGTTGTACCAAGACGTCTCGTGTCGTGGCAGCGGTACATGTGTGTTAGCAAGTCCTGGATATCCTGGCCTGTACCCTCCACATCGTCGATGCCGATATCTCTTTGCTACAAACTCGGTACACACCAGGGTCAAAATTATATTTACATCTATATTGCTACCTAAGAACCATTGTACTACGGACTATTTAACGCTCCGGGCTGGCAGTTCGCCGTCAGCACCACTTCTCGCTTCACTCTGCTCTGAAAGAAGCGCAACTTTAGAACATCCTGGGCCAAATTTGTTACTGGAATTCAATTCTGGGCCTTTGGTATCACCATATGACTACAATGGCTTCATTGCGAAGTTAGAATTCCTTGAAAAAATAGAAGAAAATTTTCTCGCACCTACGATACCTGCTGCGCCACCACTCGCTGCGCTCACACATTTTACATATGGGGAAAGTAATGTAGCAATCGATAACGATGCAACATCAAACGAAATATCAAATAATAATGGACGTATAGGATGTGGTGCAACCGTTCGTGGTTTTGGAGCGGGCGTAATAAGATCAGGACATTTTGATACAAGAGCAACGACATGGCGAGCACAATGTATAATACATTTACTTGGAATGCCGACGGATGTGGTTCAAGTATCTCTATTTAATTATAATTTAAGGTTACAAGGTTGCAGATCTCATATTGAGGTCTTGGAAGGTGTTCATGAATATGGATTGAAAGGGGATAAAGGGGATAGATATGAAAGAGGAGACAGAGCCCTCCTAAGGATATGTGGTCCTTCTGTAAGAGAAGCAAGAGATCCTTCGGGAAGGTTTTTAGTAAGACAGGCCGTTACATCTAAGAGTTCAAACTTAACCATATTAGTGAGAAGAGCTACAACCCAAAGTGTCGACGAAGAGGAATTTGTGGATGGTGCCTTCTCATTTCATGACGAACAGCATGAGGGTACGGTTTCACCCGATGCTACATGTGCATCCACACACTACGGCCTTGCTGCGCCGGTGCATGGTGGGGTTTCAGCTCCTTCACATCATCACATTTTTTGGAATATTGAGGAGAGACTTATGTGCACTCATCGGTTTATACCAGCATCCAATCAAAGTGTTACAATCGAGATCCAAAGGTTGGAGCGTATGTGGAGCGCTGAACCAACTGGAAGTGTGGTAACCGGAACTGGTGGCTGTCGCACCGCTTGTGGTGATGCTGGTTGTGAATGTCGCGCCAACACACCACTTCGATACCACGACCACGTGGCACTTGTGGCGGGCGATGGTACACACTTGTCCTGTCTTTGTGGCGATTTCCAGGCTGCTTGGCTGCCTGTTGTAGTTAGGAGCTGGACGAGCTTGCGCTTGGAGTATTCAGTCGCCCACTACACGTACGCGAGCCGAGGATTTGATTATGCTGCTGCATATAGCTTTAATGACGATGCTATGTGTGGTCAACGCACGTACACTACACATTCTGGGGAGATTTCGTCGAAGAATGTTTCCGTTACGGGCAGTTTAAATGAATTTTTTTATCAACAATGTACTTGGGTGTTGGACTCTAATGTTGAACGACAACTTTACATTGATATATGGTCAGAACAGGATAAGTCGTGCAGCTCGTGGAATATTACACTTCACGAATGGTCGGGCATGGGGGCGCACAGTGAAGGCGGGCTGGCGGCATCGGCTGGCGATCTACTTTATACCTTCTGTGCTCGTCATAAGAACCATACATACACGCTACCCTGGAGACTCAACACAGTTGTTATTAGCTCGAGGTATGAAATGGGTTTGCTTGTTCCCGTTGGTCATAAACATGGCGTAACTGGCGGTATGTCATCTGCCGCCTCGGTTATTAGAGGCATGAGGTCGCTCGACGCGACAGCCCACAAACTTACCCTTCAAAATTCAATTACTTATTTGGCAACGTTCAAATTAGAAGCGGATGCTGCTTGGCTGCCTGTTGTAGTTAGGAGCTGGACGAGCTTGCGCTTGGAGTATTCAGTCGCCCACTACACGTACGCGAGCCGAGGATTTGATTATGCTGCTGCATATAGCTTTAATGACGATGCTATGTGTGGTCAACGCACGTACACTACACATTCTGGGGAGATTTCGTCGAAGAATGTTTCCGTTACGGGCAGTTTAAATGAATTTTTTTATCAACAATGTACTTGGGTGTTGGACTCTAATGTTGAACGACAACTTTACATTGATATATGGTCAGAACAGGATAAGTCGTGCAGCTCGTGGAATATTACACTTCACGAATGGTCGGGCATGGGGGCGCACAGTGAAGGCGGGCTGGCGGCATCGGCTGGCGATCTACTTTATACCTTCTGTGCTCGTCATAAGAACCATACATACACGCTACCCTGGAGACTCAACACAGTTGTTATTAGATTAGTTGCACTATCACGACAACAACCATTATATACAATCCGATGGCGGTCACAAGTAGTGCGGGCTAACAGGCCCAGTCCGCCCACTCCAGCCCCCGCAGCAGCAGCCATCGCCTCCAATCTCGCGCCATATAATGTTATTTTAATTTTATTGGCGCTTACACGATACGTAAACACCACCTGA

Protein sequence:

>DPOGS203780-PA
MRQAVALLLTACAVSAQIAWTPIFLDGSSQVDLWTQSSSGCACPGQGGEECACCVRDGACPCGDFAPGRCAQCGLEQFCSNMCNMTIDSRVLKNKSGKTFGQIKSPSIEGPSTCSYILQPDAGQRVEIQVYRLVSVGRYNGSSCVGGWLQLSGASAVQDASGAADTRLCGTNERYTPPVVLFADHGAAALDFRITEKTERSQFLAFFSFTSLSNTQGVGFHPRGGSRIPNTDCDWLYQDVSCRGSGTCVLASPGYPGLYPPHRRCRYLFATNSVHTRVKIIFTSILLPKNHCTTDYLTLRAGSSPSAPLLASLCSERSATLEHPGPNLLLEFNSGPLVSPYDYNGFIAKLEFLEKIEENFLAPTIPAAPPLAALTHFTYGESNVAIDNDATSNEISNNNGRIGCGATVRGFGAGVIRSGHFDTRATTWRAQCIIHLLGMPTDVVQVSLFNYNLRLQGCRSHIEVLEGVHEYGLKGDKGDRYERGDRALLRICGPSVREARDPSGRFLVRQAVTSKSSNLTILVRRATTQSVDEEEFVDGAFSFHDEQHEGTVSPDATCASTHYGLAAPVHGGVSAPSHHHIFWNIEERLMCTHRFIPASNQSVTIEIQRLERMWSAEPTGSVVTGTGGCRTACGDAGCECRANTPLRYHDHVALVAGDGTHLSCLCGDFQAAWLPVVVRSWTSLRLEYSVAHYTYASRGFDYAAAYSFNDDAMCGQRTYTTHSGEISSKNVSVTGSLNEFFYQQCTWVLDSNVERQLYIDIWSEQDKSCSSWNITLHEWSGMGAHSEGGLAASAGDLLYTFCARHKNHTYTLPWRLNTVVISSRYEMGLLVPVGHKHGVTGGMSSAASVIRGMRSLDATAHKLTLQNSITYLATFKLEADAAWLPVVVRSWTSLRLEYSVAHYTYASRGFDYAAAYSFNDDAMCGQRTYTTHSGEISSKNVSVTGSLNEFFYQQCTWVLDSNVERQLYIDIWSEQDKSCSSWNITLHEWSGMGAHSEGGLAASAGDLLYTFCARHKNHTYTLPWRLNTVVIRLVALSRQQPLYTIRWRSQVVRANRPSPPTPAPAAAAIASNLAPYNVILILLALTRYVNTT-