Python: Need help splitting an input of binary code, with no spaces

Question

need to split every 8 char so it will become a list whereby i can then translate into ascii and then english. I just am at a loss for how to split the input,(one large string of binary numbers) into readable binary numbers, instead of just one string.

For example, the input string "010000010100001001000011" can be split in to octets as follows: "01000001","01000010","01000011".

What i have so far:

def main():
    import string

    #take user input of binary
    code = raw_input ('Please type in your binary code to be decoded: ')

    #split the code
    for word in code:
         print code[0::8] + ' '

    #replace the input with the variables
    ascii = ' '
    for word in code:
        ascii = ascii + int(word,2)

    english = ' '
    for word in acsii:
        english = english + chr(word)

    #print the variables to the user
    print english

#call Main
main()

Well, what's the problem? (Consider showing your output, and your expected output) — Arafangion
– Arafangion, Commented Feb 14, 2012 at 1:57

wim · Accepted Answer · 2012-02-14 02:17:58Z

5

This should help you some of the way, with list comprehensions:

>>> b = '010000010100001001000011'
>>> bin_chunks = [b[8*i:8*(i+1)] for i in xrange(len(b)//8)]
>>> print bin_chunks
['01000001', '01000010', '01000011']
>>> ints = [int(x, 2) for x in bin_chunks]
>>> print ints
[65, 66, 67]
>>> chars = [chr(x) for x in ints]
>>> print chars
['A', 'B', 'C']
>>> print ''.join(chars)
ABC

edited Feb 14, 2012 at 2:17

answered Feb 14, 2012 at 2:04

wim

368k114 gold badges681 silver badges816 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

user1208098 Over a year ago

okay thank you, now i'm starting to see how to set it up, thank you very much!

user1208098 Over a year ago

Okay this worked very well, thank you very much. everyone will be seeing a lot more of me, i'm very new to python.

Ignacio Vazquez-Abrams · Accepted Answer · 2012-02-14 02:03:30Z

2

>>> re.findall('[01]{8}', '010000010100001001000011')
['01000001', '01000010', '01000011']
>>> ''.join(chr(int(x, 2)) for x in re.findall('[01]{8}', '010000010100001001000011'))
'ABC'

answered Feb 14, 2012 at 2:03

Ignacio Vazquez-Abrams

804k160 gold badges1.4k silver badges1.4k bronze badges

2 Comments

user1208098 Over a year ago

thanks, you did it all in 4 lines that's cool, i'll look up more about this 'findall' seems useful.

Ignacio Vazquez-Abrams Over a year ago

@user1208098: One line, actually.

tito · Accepted Answer · 2012-02-14 02:21:25Z

2

I'm not sure that you want this, you maybe it worth it:

>>> s = "010000010100001001000011"
>>> [int(s[i:i+8], 2) for i in xrange(0, len(s), 8)]
[65, 66, 67]

But if you just want the '01' format:

>>> s = "010000010100001001000011"
>>> [s[i:i+8] for i in xrange(0, len(s), 8)]
['01000001', '01000010', '01000011']

thanks Ignacio, i must sleep.

edited Feb 14, 2012 at 2:21

answered Feb 14, 2012 at 2:04

tito

13.3k1 gold badge57 silver badges76 bronze badges

4 Comments

tito Over a year ago

It's an integer division operator, no need to remember?

senderle Over a year ago

@tito, look at the numbers at the bottom. Do they look right?

Ignacio Vazquez-Abrams Over a year ago

What are the offsets you're slicing at?

wim Over a year ago

@tito he means to tell you that you are slicing s[0:8], s[1:9], s[2:10] etc. when you should be slicing s[0:8], s[8:16], s[16:24] etc

Escualo · Accepted Answer · 2012-02-14 02:38:23Z

0

I am not sure I understand your question, but it seems that you are trying to do the following: Given an string of length 8n, convert each chunk of 8 binary digits into a (unicode) string then join the resulting string with no spaces.

If this is the case, then this will do the trick:

stream = "010000010100001001000011"
grouped = [stream[n:n+8] for n in range(len(stream)/8)]
characters = [unichr(int(c, 2)) for c in grouped]
result = u"".join(characters)
# returns u'A\x82\x05'

Edit: You mention "I want them in ASCII and then in English letters", then do the following:

ascii = [int(c, 2) for c in grouped] # this is a list of decimal ascii codes
english = [char(a) for a in ascii] # this is a list of characters - NOT UNICODE

but be careful, chr is only valid in range(256).

edited Feb 14, 2012 at 2:38

answered Feb 14, 2012 at 2:14

Escualo

42.5k26 gold badges95 silver badges139 bronze badges

1 Comment

user1208098 Over a year ago

all right except i don't want to join them at the end. what i'd like to do is have them separate, and be able to get them into ascii then to english letters. does that clear it up?

Collectives™ on Stack Overflow

Python: Need help splitting an input of binary code, with no spaces

4 Answers 4

2 Comments

2 Comments

4 Comments

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

2 Comments

4 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related