Monarch geneset OGS2.0

DPOGS206048
TranscriptDPOGS206048-TA1185 bp
ProteinDPOGS206048-PA394 aa
Genomic positionDPSCF300028 - 1100406-1104201
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0028184e-16375.68% 
BombyxBGIBMGA000500-TA4e-12682.33% 
DrosophilaCG13313-PA8e-9346.83% 
EBI UniRef50UniRef50_Q9VSR61e-9046.83%CG13313 n=7 Tax=Drosophila RepID=Q9VSR6_DROME
NCBI RefSeqXP_623770.11e-9649.57%PREDICTED: similar to CG13313-PA [Apis mellifera]
NCBI nr blastpgi|3071894019e-10049.72%Intraflagellar transport protein 122-like protein [Camponotus floridanus]
NCBI nr blastxgi|3071894012e-10049.44%Intraflagellar transport protein 122-like protein [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[100-213] IPR0008591.3e-13CUB
Orthology groupMCL12144 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206048-TA
ATGACAAGCAAACTATTTTGGTTTATTTTCATAACATCAGTTATCGCTATTGCAGCGTTTACTGTTTTGGAAGACAATTTCGAAGATGAAGACAACTATGACGTCGATACTCATGGAAGAAGAGAAGGCAAAATTTTATTCCCGTTCGTCAGTATCGTCCGTTTCGCCAACGTGGAATGTTCTAGTTCGAGTACAATGAACGGCATCTGTCTTGCTCGTCGCGAATGCAATAATCTTAATGGCACCATCACAGGCACTTGTGCCTCTAGAAGAGGCAGATGCTGCATTGTATCAAGAAGTTGTGGAGCAACCACAAACGTAAACAACACTTACTTCACTAGTCCGGGATATCCTTCGGCTTACGCTGGCGGACAATCGTGCAGCATCACAGTCAATCGCTGTAATAGCAATATTTGTCAGCTTAGAATTGACTTCTTGGATTTGGTTTTGGCGCAACCCGATGGCGATGGCATCTGCAACACCGATTTCATAAGTATAACTGGAGGGAATACAGTTGTCCCTCTTCTTTGCGGTGACAATACTGGTCAGACTTTATTCGTTGATTTCAACGGAAACACCGCTATAACAATAACCGTGACAGCAACCCTTGCCACCACGTTTTCGAGGCGATGGAACATACTGTTAACTCAACTAGGATGCGATTGTCCTGGAATTGCGCCAAATGGATGCCTGCAATACTACACGGGTACTACTGGGACCATAAACAGCTTCAATTACGGCACCGCAGCTAACACTGCTCTGAGTGCGTCACTCGTTACGGGAACTAGACAAATCGCCAATTTAAACTATGGTATATGCATCAGAATGGAGGCCGGTTTTTGTGGAATACAATATGCACAATCTGCAAGTAGCGTTTTTTCTTTCACTGTGACGGGGGACGTGGAAGGCGCTGATAACACAGTGTTGGGGACGCCAATAGGAGCTGTGAACGGGGTCGCGTGCACAACTGATTTCGTCGTTATACCCAATCCTGTGACTACTGCTACCGGTGTACCAGTCAACACTGATCGCTTTTGTGGTCTTGGTTTCGTGCCCGTGCAAACTAACGCTAAACCGTTCGTGTTATACGTTGTAACGAATGGTAATGAGGGTGTGACTGCAACGACGCCGCCAGATGTCGCCAACAGAGGTTTCTCTCTCATGTACACTCAAGTCGCCTGTTAG

Protein sequence:

>DPOGS206048-PA
MTSKLFWFIFITSVIAIAAFTVLEDNFEDEDNYDVDTHGRREGKILFPFVSIVRFANVECSSSSTMNGICLARRECNNLNGTITGTCASRRGRCCIVSRSCGATTNVNNTYFTSPGYPSAYAGGQSCSITVNRCNSNICQLRIDFLDLVLAQPDGDGICNTDFISITGGNTVVPLLCGDNTGQTLFVDFNGNTAITITVTATLATTFSRRWNILLTQLGCDCPGIAPNGCLQYYTGTTGTINSFNYGTAANTALSASLVTGTRQIANLNYGICIRMEAGFCGIQYAQSASSVFSFTVTGDVEGADNTVLGTPIGAVNGVACTTDFVVIPNPVTTATGVPVNTDRFCGLGFVPVQTNAKPFVLYVVTNGNEGVTATTPPDVANRGFSLMYTQVAC-