Monarch geneset OGS2.0

DPOGS213114
TranscriptDPOGS213114-TA2328 bp
ProteinDPOGS213114-PA775 aa
Genomic positionDPSCF300016 + 288271-295049
RNAseq coverage575x (Rank: top 22%)
Annotation
HeliconiusHMEL0097880.055.05% 
BombyxBGIBMGA007876-TA0.057.70% 
DrosophilaCG11880-PA1e-15941.41% 
EBI UniRef50UniRef50_D6WC460.043.21%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WC46_TRICA
NCBI RefSeqXP_002000902.12e-17541.85%GI22273 [Drosophila mojavensis]
NCBI nr blastpgi|2700023122e-18043.21%hypothetical protein TcasGA2_TC001323 [Tribolium castaneum]
NCBI nr blastxgi|1582900260.042.82%AGAP010343-PA [Anopheles gambiae str. PEST]
Group
KEGG pathwayspu:5930449e-82 
 K12819 (SLU7)maps-> Spliceosome
InterPro domain[11-773] IPR0076031.3e-255Choline transporter-like
Orthology groupMCL10290 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213114-TA
ATGGGTTGCTGTGATAATTGCTGTGTGCCACCGAAAGAATCAAGAGAGCCTATAAAATATAATCCGAACTTCAACGGCCCGACCCGCAACCGATCATGCACGGATGTATTATTTTTGATTATGTTCATCCTTTTTCTTGGAGGCTGGGGCTTCGTCGGCTATTACAGTATAACACGAGGGAGCGTTGAGAAACTTTTAGCACCTATAGACTCGAAAGGCCGCCGTTGTGGTGTGGATTCCGGCCTTGAGGATAGGAAATATCTTGTATTCTTCAACATCGCAAAATGTCTGTCGCCCGGCACTCCTATAACTGGTTGTCCCACTGCACAAGTATGTGTGCAAAAGTGTCCTGAGAGAACTTTTATATTTGAAAAAGAGTTAAGACAAAATCCAGCTTCGTTTGAACAATTAAGGCAGAATATGGTGTGCACCGATGAAGTCACTAATATACAAACTATGACGTTATCGGAGGCTATCGCGTACATGCAACAAGAAAAGTGCGCCAGTTTTGTGTTACAGAGTCAACCAGTGTTGTACCGTTGCATCGGAGACCTCTCAGCGCTATCCTGCGAGTCACCGGCGGCCAGCTCTTCCGAGGTCTGTGTGCGAAACCCGAAGGAAGCCCAGAAGTCCTTGATCGAAGTCTTGAATGCCATCGATTCTTACGTTGGCTGGTTCACCGCGAGATGGGTCACGTTTTTCACGAGGAACGAGAAAGAAGCTCACATTGTAGTGTTGGGTCGCTGCTTTGTTAACGTCACCGCGGCTGTTGAAGTCTTGCGAGAGGTTTCAGACCTCGATCTGGGAGAAGGACAGGTGACCGAACAATTCACCAAGTTGCTTAAATTCAATCAGTTGTCGTCGCAGATAGTCCAGGATCTGCAGCAGTCCCGCTGGTACTTAGCCGGTGCTCTGATCGGCATCGTGCTGCTATGTTTTATATACATACTGCTGTTACGATGGCTGGTCGCGCCCGTTGTATGGGTCTCGATCGCAGGCCTTTTCGCTCTCCTTGGATTCGCCATTTATCTTTGTTACAAGAACTACATTTACTATAAAGAAAACGTTGGCTTGTACCAGGAAACAAATTTGAAGGGTTACGTGGAGTCAATATTCGGGAAATACCAAACATGGCTAGCTCTGGTCATTATATTGGCAGCTATTTTTATTATTTTATTACCGGTTATCATATTTTTGAGAAGTCGAATAACGATAGCGATAGCTCTCATACGAGAAGGCAGCAAAGCAGTAACAGCTAATAAGACGACGATAGTGTTTCCGATATTCCCTTGGATATTCCAATGCGCTATAATCGGTTACGGCGTGCTGGTCCTCATGCACTTGTTGTCTATCGGACAATCGGCCTTCCAGGTCGTTAACAAGAGAATCGATACCAATTGCGACTGCGGCGGCTTGTACAAAGAGGACGGCGTGTCGTGTGATCCAGTGAGGTTCGCGGCTAGTTGTCACGACACGTCCAACGCTCTGCTGCCGTGCGCTAACGCGACCTGTCACTACACGGGCGTCGACAGTCCACACTACATCATCTACTTGCACCTCGTAAATTTACTAGGATTTTTCTGGGCAATGTTCTTCATAAGCGGTGTGGCAGATATGACCCTAGCAAACACATTCTCTACGTGGTATTGGACTTACAACAAAAGAAATCTTCCTTTTTTCACTCTAACGTCATCAATTTATACAACAATTCGGTATCATTTAGGGACAGTTGCTTTCGGTGCACTTATAATTGCAATAGTCCGAGTTATCAGAGTCATTTTGGAATACATAGACCATAAAGTGAAAAAGTTTGACAACGCTTTTACGAGAGCTATACTGTGTTGTTGTCGGTGCTTCTTCTGGTGTCTGGAGAATTTCTTGAAGTTCGTGAATAAGAATGCTTACATCATGTGCGCTATTCACGGAAAGAATTTTTGTAAGAGCGCCAGTGACGCATTCTCGTTGCTAATGAGGAATGTGGTCCGACTCGTCGTATTGGACAAGATGGCGGATTGGATATTCTTTCTGTCCAAGCTGTGCATCTCCTTGGGCGTGGGTCTGGGCGTGTTCTATTTACTGGAGTGGGGGGCGCTGTACGAGGCGGGCGCCGCTCGTCTTCATCACCACCACGTGCCAGCAGCACTCCTCGCCATCGCCACCTACCTCATATGTACAATCTTCTTCAATGTATACTCCACCGCTGTGGACACGCTGTTCTTATGTTTCCTGGAAGATTGCGAAAGAAATGACGGTTCTGAAGAGAAACCATACTTCATGTCGAAAAATCTTATGAGGATTCTTGGGAAGAGGAACAATATTGTCAAGTAA

Protein sequence:

>DPOGS213114-PA
MGCCDNCCVPPKESREPIKYNPNFNGPTRNRSCTDVLFLIMFILFLGGWGFVGYYSITRGSVEKLLAPIDSKGRRCGVDSGLEDRKYLVFFNIAKCLSPGTPITGCPTAQVCVQKCPERTFIFEKELRQNPASFEQLRQNMVCTDEVTNIQTMTLSEAIAYMQQEKCASFVLQSQPVLYRCIGDLSALSCESPAASSSEVCVRNPKEAQKSLIEVLNAIDSYVGWFTARWVTFFTRNEKEAHIVVLGRCFVNVTAAVEVLREVSDLDLGEGQVTEQFTKLLKFNQLSSQIVQDLQQSRWYLAGALIGIVLLCFIYILLLRWLVAPVVWVSIAGLFALLGFAIYLCYKNYIYYKENVGLYQETNLKGYVESIFGKYQTWLALVIILAAIFIILLPVIIFLRSRITIAIALIREGSKAVTANKTTIVFPIFPWIFQCAIIGYGVLVLMHLLSIGQSAFQVVNKRIDTNCDCGGLYKEDGVSCDPVRFAASCHDTSNALLPCANATCHYTGVDSPHYIIYLHLVNLLGFFWAMFFISGVADMTLANTFSTWYWTYNKRNLPFFTLTSSIYTTIRYHLGTVAFGALIIAIVRVIRVILEYIDHKVKKFDNAFTRAILCCCRCFFWCLENFLKFVNKNAYIMCAIHGKNFCKSASDAFSLLMRNVVRLVVLDKMADWIFFLSKLCISLGVGLGVFYLLEWGALYEAGAARLHHHHVPAALLAIATYLICTIFFNVYSTAVDTLFLCFLEDCERNDGSEEKPYFMSKNLMRILGKRNNIVK-