Sometimes when your programming, you will come across a situation when you’ll we want to extract part of a text string but not all of the text string. Maybe it is in file names, column names or other text fields. It could be a number or just part of a date. Let’s see how we can accomplish this with these examples using the built-in Python split method.
In this example, I want to get the symbol of the stock from the string: AAPL-EarningsPerShare.
Here, I create a variable that takes the ‘AAPL’ part out of the phrase by using the built-in split method. I am specifying that I want to take this string ‘AAPL-EarningsPerShare‘,
I want to split the string by the dash (‘-‘), and finally I want to take the first part of the split [0]. If I wanted the second part of the split, I would use [1] at the end of this function.
Symbol = 'AAPL-EarningsPerShare'.split('-')[0]then display by using the print function.
print(Symbol)
output:
AAPL
Let’s get the second part of the string:
Symbol_Part2 = 'AAPL-EarningsPerShare'.split('-')[1]
print(Symbol_Part2)output:
EarningsPerShare
In this next example, we have a date like this: November_27_1942
Let’s get all three sections of this text string. We will need to split (‘_’) the string by underscore instead of a dash this time. And we will need to specify which part of the string we want with the ending location numbers of [0], [1] and [2].
Month = 'November_27_1942'.split('_')[0]
Day = 'November_27_1942'.split('_')[1]
Year = 'November_27_1942'.split('_')[2]
print(Month)
print(Day)
print(Year)output:
November 27 1942
You can see from the output, we got all three sections separated out. This can be a helpful function when dealing with groups of similar files or text fields. See the python documentation.
Article by Zachary Storella – See more programming posts on our Python Page
