Monarch geneset OGS2.0

DPOGS210655
TranscriptDPOGS210655-TA1386 bp
ProteinDPOGS210655-PA461 aa
Genomic positionDPSCF300401 + 123266-130557
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0107913e-3577.34% 
BombyxBGIBMGA001643-TA2e-10253.53% 
DrosophilaCG13510-PA9e-0942.25% 
EBI UniRef50UniRef50_E0VEE21e-1144.78%Lipopolysaccharide-induced transcription factor regulating tumor necrosis factor alpha, putative n=1 Tax=Pediculus humanus corporis RepID=E0VEE2_PEDHC
NCBI RefSeqXP_002424486.13e-1244.78%lipopolysaccharide-induced transcription factor regulating tumor necrosis factor alpha, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420073145e-1144.78%lipopolysaccharide-induced transcription factor regulating tumor necrosis factor alpha, putative [Pediculus humanus corporis]
NCBI nr blastxgi|1571157804e-1437.74%hypothetical protein AaeL_AAEL007344 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[207-275] IPR0066293.6e-18LPS-induced tumor necrosis factor alpha factor
Orthology groupMCL44322 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210655-TA
ATGGATAATGAAAATGATAACCACAATATAGACAATAATCCAACGAATAGAGATGTACAAACGGAGGAAGATAACGATGATATATCGATTACATCTAAAGAAATCATGACTTATAAGTTAGATGTAAGCCCAAATGGCGTCGACCCCGTTTATAATAACTGTTCTCCGAGATTCTCAAGAGAGGGACCAAGTGCTCTCCCAGAATCTCCAGTCGTGTTTACCCAACCTCTGCGCATATTTTCGAACGTACCATCACCAATGTCGTTAAACATGCCGACGTCCTCGCAGCCCCAATCTTTATTCTTGCAGGCTTTAGCCAAATTAAACACGAGCTCTATAAGTTTAGCATTCAAAAATATCGACGACAAACTAAGATTCAATAACGATTCAGCTCTTAATAAAGAACTATCACCGCCCGATTCTTTGACTAGCGCGCCGCCGTCCTATTCATTTGTACTGCGGCAAATGGCTGCAAGGAGAAGGCCTAGATTGATGGGAACTTTCTATCCATCACCGTCCTTCGTGCAGCACACACCTCCTCCGAACTACGCTACGGCTTTCGATATCTACGTTGATAATCCCATTACGCAACCACCACCTAGGATATATAATTTTGGATTTACGCCAATGCCTATCGTCTGCCCTCAATGTGGACACACGGGGATGACAGTGGTTACCTGTAAAATAACGTTGTGTACCCATTTATGTGCAATGTCGTTATGTTTGATGTGTTGTTGGATATGTGCGCCCCTGCCATACGTCCTACGATCATGCAAAGATGTTTATCACTACTGCAGGAATTGTCGCAGTTATCTTGGAATGTACTGCCCCACCAGTCCAGATAGGGCATGGCGCCGCGCTCGTGATCCTAGGTCGGCGCGAAGCGCGCGCCCGGGCTTACATAACGTGGATAGCAGCGGCATGGCCGCGCGGGTGTCCGTCGAGGGCGCAGGACCCTCGTACGCGTCGGTGCTCAACTTTAGAGTGAGTGACAGCAATAAAGAAAACATAGAGGCCACGGAGGAGGTGGCAGACGCGCCACCTCGCCAGGATGATGAGGAGGAGGAAGGTTTCGTGCCAGTCATCTCGCACGCTCGTAGACCGGCCAAGAGCCGCAGGGAGCGAGAGCGCAGAGCTGCACCTCGTCCGCACAAGGAACCCAAGCCGCAAACGGAGCCCCAGCCAGCGTCTGACCAACAACAGTCAACAGAACCGCAGCCCAAGAAGTTCGTGGAAGCTCCCATACCCAAAGTAAACCCTTGGCAGGTGGAGATGACCTCGGCGGTCACCAGAAAATATATGACGAGCGACTTCAAGACGCACCCGAAAAACCCTCAGGCAGTAGAAGCCCTATCCATCAGTTATTCCGACTTCCAAATCAGCTGA

Protein sequence:

>DPOGS210655-PA
MDNENDNHNIDNNPTNRDVQTEEDNDDISITSKEIMTYKLDVSPNGVDPVYNNCSPRFSREGPSALPESPVVFTQPLRIFSNVPSPMSLNMPTSSQPQSLFLQALAKLNTSSISLAFKNIDDKLRFNNDSALNKELSPPDSLTSAPPSYSFVLRQMAARRRPRLMGTFYPSPSFVQHTPPPNYATAFDIYVDNPITQPPPRIYNFGFTPMPIVCPQCGHTGMTVVTCKITLCTHLCAMSLCLMCCWICAPLPYVLRSCKDVYHYCRNCRSYLGMYCPTSPDRAWRRARDPRSARSARPGLHNVDSSGMAARVSVEGAGPSYASVLNFRVSDSNKENIEATEEVADAPPRQDDEEEEGFVPVISHARRPAKSRRERERRAAPRPHKEPKPQTEPQPASDQQQSTEPQPKKFVEAPIPKVNPWQVEMTSAVTRKYMTSDFKTHPKNPQAVEALSISYSDFQIS-