Monarch geneset OGS2.0

DPOGS203961
TranscriptDPOGS203961-TA1212 bp
ProteinDPOGS203961-PA403 aa
Genomic positionDPSCF300005 + 443565-447601
RNAseq coverage265x (Rank: top 40%)
Annotation
HeliconiusHMEL0135130.084.77% 
BombyxBGIBMGA002093-TA4e-8866.41% 
DrosophilaCG14971-PA9e-9343.65% 
EBI UniRef50UniRef50_Q7PNQ11e-10662.06%AGAP005537-PA n=6 Tax=Culicidae RepID=Q7PNQ1_ANOGA
NCBI RefSeqXP_001648026.12e-10851.95%solute carrier family 35 member C2, putative [Aedes aegypti]
NCBI nr blastpgi|1571035453e-10751.95%solute carrier family 35 member C2, putative [Aedes aegypti]
NCBI nr blastxgi|1582943361e-10562.06%AGAP005537-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[187-330] IPR0048531.5e-27Domain of unknown function DUF250
Orthology groupMCL12143 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203961-TA
ATGCCAGGCGCAAAATATGAACAGCTGTCGTTGAATGCCGACGCCGATGAAGATGTTTTATCGAATAAAACTAAGGCAAAATGGTCTGACGTGTGTTTCCAAAAAAGTCTTCTGTCTCTAGGATTAATACTTCTATATTTTTCTCTGTCTATCGGATTGACCTTCTATCAAAGATGGCTTTTGAAGGATTTTCACTATCCTTTGACAGTGGTGATGTACCACCTAATTGTCAAATGGGTGCTCTCAGTTTTTGTCAGGATGATTATGCGCTTGATAACTGGTATGCCTCAACTCTTGTTGCCATTTATGACCTGTTTGAAATCGGTTGGCCCTACGGGTTTGGCTAGCGGCATTGATGTTGGATTCTCTAATTGGGGCCTTGAACTTGTAACCATATCTCTGTACACCATGACTAAATCTACCACTATCATATTTATTCTCGGATTTGCTATTCTTCTGGGATTAGAAAAGAAGTCATGGTCATTAGTCGGCATTGTCCTGATGATAGCAGCTGGTCTGATTATGTTCACATATAAAGCAACCCAGTTCAATTTGTTGGGTTTTAATTTCCTACTTTTGGCTTCATTCGCTGCGGGGCTAAGATGGACTTTTGCTCAGTTGTTAATGCAGAAATCGAAACTTGGACTCCACAACCCTGTGGATATGGTGTTCCATGTTCAACCCTGGATGTTCCTATCCTTGCTTCCGTTTACTATTATGTTTGAAGGTATGAACTGCCTTCAGTACATGTACGAGTTGCCACCATCGGAACTGTTGCCATCAGTGCTGAAGGTATCTGTGGGAGCCACTATAGCCTTCGCCATGGAGATCAGCGAATTCCTCGTGGTCACATACACGTCTAGTCTTACTTTGTCTATAGCGGGAATATTTAAGGAAATGTGCATTCTAGTCCTAGCAGTTGAAGTGAGTGGAGATCTCCTTAGTCCCATCAATGTTGTGGGGCTGGCTGTGTGTTTGTTGGGTATAAGCGGCCACATAATCCACAAAATATTGGTTATTAAATCGGTTACCGGTTCAGTGCGGGCCATTCACTACAATAATATGAGAAGTCGGCTTGAAAAGTCCAAAGAAGACCATGGTGAACCTCTGTTAGTAGATGATAACAAATGGCAGAATGTTGCAAGCGAAGAAAGTGACATAGACTCCAATGTTGTCATATATGAAGTACTACAGAGAAGGGATGGTAGATAG

Protein sequence:

>DPOGS203961-PA
MPGAKYEQLSLNADADEDVLSNKTKAKWSDVCFQKSLLSLGLILLYFSLSIGLTFYQRWLLKDFHYPLTVVMYHLIVKWVLSVFVRMIMRLITGMPQLLLPFMTCLKSVGPTGLASGIDVGFSNWGLELVTISLYTMTKSTTIIFILGFAILLGLEKKSWSLVGIVLMIAAGLIMFTYKATQFNLLGFNFLLLASFAAGLRWTFAQLLMQKSKLGLHNPVDMVFHVQPWMFLSLLPFTIMFEGMNCLQYMYELPPSELLPSVLKVSVGATIAFAMEISEFLVVTYTSSLTLSIAGIFKEMCILVLAVEVSGDLLSPINVVGLAVCLLGISGHIIHKILVIKSVTGSVRAIHYNNMRSRLEKSKEDHGEPLLVDDNKWQNVASEESDIDSNVVIYEVLQRRDGR-