Monarch geneset OGS2.0

DPOGS212669
TranscriptDPOGS212669-TA1146 bp
ProteinDPOGS212669-PA381 aa
Genomic positionDPSCF300198 + 131628-134874
RNAseq coverage311x (Rank: top 36%)
Annotation
HeliconiusHMEL0179723e-10770.79% 
BombyxBGIBMGA013675-TA8e-8865.52% 
DrosophilaCG15011-PA5e-3636.96% 
EBI UniRef50UniRef50_E0VZX82e-4440.74%Nuclear transcription factor, X-box binding protein, putative n=2 Tax=Pancrustacea RepID=E0VZX8_PEDHC
NCBI RefSeqXP_002431671.13e-4540.74%nuclear transcription factor, X-box binding protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838500201e-4442.38%PREDICTED: NF-X1-type zinc finger protein NFXL1-like [Megachile rotundata]
NCBI nr blastxgi|1700538611e-5940.94%NF-X1-type zinc finger protein NFXL1 [Culex quinquefasciatus]
Group
KEGG pathway 
InterPro domain[260-376] IPR0189089.1e-33Uncharacterised protein family UPF0546
Orthology groupMCL12671 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212669-TA
ATGGTTACATGCTTTGGTGAGCATGAAACAGATAACCAACCGTGTCACACTGCTTCAAGGAAGCCGTGTGGAAGACAATGTGGTCGTCCATTAGCATGTGGAAACCATAAGTGTGAATTATCCTGTCATTTGTATGAGCCCAATGCAGATTATCCAAATGTACCATATACATGCAAACCATGTAACAGAGAGTGCTTGGTAGTCCGTCCACCGAAATGTACACACAAGTGTGCAAAACAGGGCTGTCACCCTGGACCTTGTCCGCCATGTAATATACTAGAAAGGATACCCTGTCATTGTGGTGTAACCGAGATATATGTGAGATGTCGTGAGTTACAGAGTGCTACAGAAGAAATGCTAAGTTGCAAACAACAATGCCCTAAGAGTCTGGAATGTGGTCATAGATGTAAAAACCTGTGCCATTCAGGTAGCTGTGGGCAGAATCAAGTATGCAACAAAAAGACTAAAATACACTGCCCATGTGGCAATTTAAAGAAAGAGGCAGCTTGCAAAGCTGTTAGGAATATGGAGGTGCAGGTCATTTGTGACGAGAGCTGTGAAGCCAAAAAAGTTGCTGCCCAATTAGAAAAAGAGAAAGAGGCGAAAAGACTCAAGGAATTAGAAGAAGAACGGAATCGTAGAGAGTTAGAAGAATACACTTGGAAATTAAGCGGCAAAAAGAAGAAATATAAAGAGAAGAAGATTGTCGTTGTTACTGATAACAGGAACTGGCTTCAGAAGTATTGGTTTCCTATTTTGTGTGTTTTGATTGGTTTGCTTGTATTAACGGGGATATTATGGGGTTGTACAAACCCATTCATAAGACAAGGCACAAAAGGTTTACGGAAAGTTTGCGCTAAAACGAAATTGGGCCAGGCTTACGCAGAGATTATTTTTCTCTTAGGGAATTGGAGGTACGTTGTACCCTGGTTGATTAACCAATGCGGTTCGTTGGTGTATTTATCGGCTGTGCAGCGTGTGCCTTTGTCTCTTGCTGTGCCTACCGCCAACAGCCTTGCGTTCGCCTTTACAGCACTAACGGGAGCAACGCTGGGTATTGAAGAGCCTTTGGATTTCGTGTCCATAATGGGAATAGTATTAATAGCTGCAGGAACCGCATTATGTTGTTGGGATAAAGTGGATTAA

Protein sequence:

>DPOGS212669-PA
MVTCFGEHETDNQPCHTASRKPCGRQCGRPLACGNHKCELSCHLYEPNADYPNVPYTCKPCNRECLVVRPPKCTHKCAKQGCHPGPCPPCNILERIPCHCGVTEIYVRCRELQSATEEMLSCKQQCPKSLECGHRCKNLCHSGSCGQNQVCNKKTKIHCPCGNLKKEAACKAVRNMEVQVICDESCEAKKVAAQLEKEKEAKRLKELEEERNRRELEEYTWKLSGKKKKYKEKKIVVVTDNRNWLQKYWFPILCVLIGLLVLTGILWGCTNPFIRQGTKGLRKVCAKTKLGQAYAEIIFLLGNWRYVVPWLINQCGSLVYLSAVQRVPLSLAVPTANSLAFAFTALTGATLGIEEPLDFVSIMGIVLIAAGTALCCWDKVD-