Monarch geneset OGS2.0

DPOGS210089
TranscriptDPOGS210089-TA2103 bp
ProteinDPOGS210089-PA700 aa
Genomic positionDPSCF300017 + 523826-531047
RNAseq coverage517x (Rank: top 24%)
Annotation
HeliconiusHMEL0154802e-15570.07% 
BombyxBGIBMGA012670-TA8e-14459.54% 
DrosophilaCG31694-PA2e-6437.12% 
EBI UniRef50UniRef50_E2AMV69e-6336.34%Interferon-related developmental regulator 1 n=8 Tax=Neoptera RepID=E2AMV6_CAMFO
NCBI RefSeqXP_002066947.11e-6538.48%GK24748 [Drosophila willistoni]
NCBI nr blastpgi|2897398631e-6539.31%interferon-related protein PC4-like protein [Glossina morsitans morsitans]
NCBI nr blastxgi|2897398633e-6439.41%interferon-related protein PC4-like protein [Glossina morsitans morsitans]
Group
KEGG pathway 
InterPro domain[315-599] IPR0077011.3e-63Interferon-related developmental regulator, N-terminal
[663-696] IPR0069212.9e-10Interferon-related developmental regulator, C-terminal
Orthology groupMCL14872 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210089-TA
ATGCTGGTTACAGCGCTGTTCCTACTGACGAGCCTGGCACAAGCACGACCGTTCAACATCACACACCTCATAGTGCCGGAAGTCATAGCTCCGGGACAGGAAGAAGTTGAGATCGAGTGCAGATATGACGCTAACTTTACATTACTCAACTGGTTCAAAGGGCCTAACGAATTCTTCAGATACAGACCGGGCGCAGCTCCCAGCACGAGATCATTCCCAGTTCTTGGAGTCGGGAGAGTCGAGCTTATAATTTGTGGACCGACGGCGTGCAGGTTGAAACTCGGCTCACTCACAGAGGAGGCGACAGGGTTGTACAGGTGTGACATCGAGAGGGATGTGCCACCTTATAAATTTGCTACTCGGACGGCCTATATGGAAGTTCACGGACATGAGCACAGAAAGCCATTACTCGAGGGGCTCGATGAGGAGTTCGGCGAGGGAGACGACATGCAGGCGTACTGTCGAGGAGATCCAGAGACGGAGATACGGTGGTATATAAATGGAAGAGAGTTAGAGGAGATGCGAGGAGCCAGCTCCTTGAAGAGGAAAAGCTCCCGCTTGATCTTCCTAGGAATCCCTCCCATGGTTACGGTGCAATGTGCAGAATACAAATTTGGGAAATTGTTTGGTTCTAACGAAGAGAGAGCTCGATGGAAAGATCATGTGGGCAGTAAGGATGAGAGACCCCAGGAGCAAAGGAATCTCTCAGCTGCAGTGGATATGAATATTAAGCATAGAAGAACTGAGTTGCCATACTCATCAGACGAGGATGCTGGAATTGATATGACCAATGACAACTACTCAGAAACATCGGGACAGTCGGACTTGAGGAGTCACGACGATGCGGGATATCAAGTCGCTCTGAGAAGCCGTCAGGGCAGACCGAGATATCCACCTCTATCAGTGCATAAACCTACGTCGTTTACATACCCATTGTCGGAATTGAATAATGAATGCACGGAAAATGAAATTCAAGAAAAATTAGAAGAGAAAGTGCTAGAAATCATTGATGCGCTTAGTGCTAGAGCTAACGCGGCTCGCGCTGCCGCTCTGGTTGCGCTACGTAATGCTTTACAAAGACGTTACCTGGCTCATTTATTGAGCGGACAGAGGGTCACACTAGCCGAACACATCACTAAGGCTTTGAGGAGAGGCAAAGACGGAGAGAAGAAGGCCGCGGCCGCTGTCGCACCGCTATTGGCTTTACAGATTGGTGAAGAAGGTACGGAAGAATTCATGTCTGAAGTCCGTCCGGCTCTGTTTGCTGCTGCCACTGACAAAACGGCCTCGCTAGACACTCGTACTGAGTGCTGTTCATCTCTGGCCGTACTCTGCTATCTGCTAGAAGAAGATCTCAATGAAATTTTAGAAGTAATGAAGATGTTTGAGACTATATTCAGCGGCAGCTACCTCAAGGGCGACGGCAGTGTGAAAATATCGGGAGCGGCGGTGGAGGAGGGGTCGTGGCACGCGGCCGCGCTGGACGGGTGGGCGCTACTGCTGGCGCTGCTGGACGGGCGACACGCCGCCGCCACGCTTCGCGAGCGCCCGCCCTCCTTCACCCGCCTGGCCGAGCTGCTGGACGCTTGCAGCCTGGACGTGCGCCTCGCGGCCGGCTGCGCCCTCGCCGCCGCGCACGAGCGGGCCGCGGACGGCGACGCGGACGGTCACGTGGCCTGGGACGAGCCCGCCGCCGCGCCGCGCCTGGCGCTGCTGGCGCGGGACTCGCACAAGTATCGCGCGAAGAGGGACCGTAAGCTGCAGCGCGCCACCTTCAGGGACATCCTCAAGTACTTCGAGGACGGCGAGATCGACGAGGAACGAGTCCGCGTGGGGGCGGAGACTCTGTGCGTGGACAGCTGGGCGGCGCGCGGCGCGTACTCGGCGCTGGCGGCGGCGCTCGGCGCCGGCCTGGCAGTGCTCGCGCCGCACGCGCCCGAGCTCCGGACCGCGCTGGGCCTGCCGCCGGCCGCGCCGCACGCCGCGCCTAGAGCTAAACTCAACAAGCTGCAGAGACATCTCCAGAACACGGCGGCGTGTAAAGCTCGCACACTGGCCCGCAATAAGAGTCGTGACAAGCGCTCGGCGGCGCTAGCTCTGTGA

Protein sequence:

>DPOGS210089-PA
MLVTALFLLTSLAQARPFNITHLIVPEVIAPGQEEVEIECRYDANFTLLNWFKGPNEFFRYRPGAAPSTRSFPVLGVGRVELIICGPTACRLKLGSLTEEATGLYRCDIERDVPPYKFATRTAYMEVHGHEHRKPLLEGLDEEFGEGDDMQAYCRGDPETEIRWYINGRELEEMRGASSLKRKSSRLIFLGIPPMVTVQCAEYKFGKLFGSNEERARWKDHVGSKDERPQEQRNLSAAVDMNIKHRRTELPYSSDEDAGIDMTNDNYSETSGQSDLRSHDDAGYQVALRSRQGRPRYPPLSVHKPTSFTYPLSELNNECTENEIQEKLEEKVLEIIDALSARANAARAAALVALRNALQRRYLAHLLSGQRVTLAEHITKALRRGKDGEKKAAAAVAPLLALQIGEEGTEEFMSEVRPALFAAATDKTASLDTRTECCSSLAVLCYLLEEDLNEILEVMKMFETIFSGSYLKGDGSVKISGAAVEEGSWHAAALDGWALLLALLDGRHAAATLRERPPSFTRLAELLDACSLDVRLAAGCALAAAHERAADGDADGHVAWDEPAAAPRLALLARDSHKYRAKRDRKLQRATFRDILKYFEDGEIDEERVRVGAETLCVDSWAARGAYSALAAALGAGLAVLAPHAPELRTALGLPPAAPHAAPRAKLNKLQRHLQNTAACKARTLARNKSRDKRSAALAL-