Python Does What?!?: String optimization in Python

Thursday, May 5, 2016

String optimization in Python

Strings are terribly important in programming. A program without some form of string input, manipulation, and output is a rarity.

Of course this means that speed and sanity surrounding string features is important. One important feature of Python is string immutability. This opens up dozens of features, such as using strings as dictionary keys, but there are some downsides.

Immutable strings means that any string manipulation, such as splitting or appending, is making a copy of that string. This can become a performance problem, especially in a world where zero-copy is one of the favorite general optimization techniques. If you've done enough string mutation, you're probably aware of the following techniques:

''.join(iterable_of_strings) (instead of repeated +=)
bytearray()
StringIO()

But in some cases Python uses the immutability to avoid making copies:

>>> a = 'a' * 1024 * 1024  # a 1 megabyte string
>>> z = '' + a
>>> z is a
True

Here, because adding an empty string does not change the value, z is the same exact string object as a. And it doesn't matter how many times you append an empty string:

>>> z = '' + '' + '' + a
>>> z is a
True

It even works when a is the only item in a list:

>>> z = ''.join([a])
>>> z is a
True

But it falls apart when you put an empty string in the list with a:

>>> z = ''.join(['', a])
>>> z is a
False

And unfortunately even the first example seems to make a copy on PyPy:

>>>> a = 'a' * 1024 * 1024  # a 1 megabyte string again
>>>> z = '' + a
>>>> z is a

False

Although something more advanced may be going on under the covers, as is often the case with PyPy.

I'm almost done stringing you along, but as a corollary reminder:

Never rely on "is" checks with ints, floats, and strings. "==" and other value checks are what you need. As a general rule, "is" is for objects, None, and sometimes True/False.

Keep on stringifying!

Mahmoud
http://sedimental.org/
https://github.com/mahmoud
https://twitter.com/mhashemi

42 comments:

UnknownMay 5, 2016 at 3:11 PM
I think the cpython behaviour is an optimisation possible because of reference counting. Cpython can tell when adding strings if it's the only reference, and can reuse the memory rather than copying in cases like above. Pypy doesn't use reference counting, so can't do the same trick, aiui
ReplyDelete
Replies
Matthias WiesmannMay 6, 2016 at 11:03 PM
This could be related to interned strings: you can force any string to be unique and in a global table using the intern() function.
ReplyDelete
Replies
amarJune 6, 2017 at 9:38 PM
Very nice
ReplyDelete
Replies
olivermirSeptember 22, 2019 at 2:50 AM
I should thank you for the undertakings you have made in making this article. I am confiding in a similar best work from you later on as well.. Enterprise SEO Services
ReplyDelete
Replies
gauthamSeptember 22, 2019 at 10:21 PM
currently trending technologies are phyton , azure . learn azure through azure training
ReplyDelete
Replies
PrwatechOctober 3, 2019 at 5:05 AM
I learned World's Trending Technology from certified experts for free of cost. I got a job in decent Top MNC Company with handsome 14 LPA salary, I have learned the World's Trending Technology from python training in btm layout experts who know advanced concepts which can help to solve any type of Real-time issues in the field of Python. Really worth trying Freelance SEO expert in Bangalore
ReplyDelete
Replies
VijiaajithOctober 14, 2019 at 1:17 AM
Very useful, keep posting..
freein

planttrainingcourseforECEstudents
intern

ship-in-chennai-for-bsc
inpla

nt-training-for-automobile-engineering-students
freein

planttrainingfor-ECEstudents-in-chennai
intern

ship-for-cse-students-in-bsnl
applic

ation-for-industrial-training
ReplyDelete
Replies
OGEN Infosystem (P) LimitedOctober 20, 2019 at 11:17 PM
Thank you so much for this useful article. Visit OGEN Infosystem for Web Designing and SEO Services in Delhi, India.
SEO Service in Delhi
ReplyDelete
Replies
easylearnDecember 3, 2019 at 8:38 PM
Very informative content and amazing.Thanks for sharing.
Data science training institute in btm layout
ReplyDelete
Replies
rajuDecember 14, 2019 at 8:37 PM
nice,...!
inplant training in chennai
inplant training in chennai for it.php
panama web hosting
syria hosting
services hosting
afghanistan shared web hosting
andorra web hosting
belarus web hosting
brunei darussalam hosting
inplant training in chennai
ReplyDelete
Replies
preethi minionDecember 16, 2019 at 1:38 AM
nice to read
inplant training in chennai
inplant training in chennai
inplant training in chennai for it.php
italy web hosting
afghanistan hosting
angola hosting
afghanistan web hosting
bahrain web hosting
belize web hosting
india shared web hosting
ReplyDelete
Replies
shriDecember 20, 2019 at 10:26 PM
nice....
internship in chennai for ece students
internships in chennai for cse students 2019
Inplant training in chennai
internship for eee students
free internship in chennai
eee internship in chennai
internship for ece students in chennai
inplant training in bangalore for cse
inplant training in bangalore
ccna training in chennai

ReplyDelete
Replies
BarshaDecember 23, 2019 at 10:13 PM

Thanks alot for the meaningful article.

digital-marketing-course-in-hyderabad/

digital-marketing-agency-in-hyderabad/

selenium-training-in-hyderabad/

salesforce-training-hyderabad/

microsoft-azure-training-in-hyderabad/

rpa-training-in-hyderabad/

photographers-in-hyderabad/

wedding-photographers-in-hyderabad/

ReplyDelete
Replies
sindhuvarunJanuary 3, 2020 at 2:56 AM
Excellent blog thanks for sharing the valuable information...
Data Science Course in Chennai
Data Science Courses in Bangalore
Data Science Course in Coimbatore
Data Science Course in Hyderabad
Devops Training in Bangalore
DOT NET Training in Bangalore
Data Science Training Institute in Chennai
Data Science Training Institutes in Bangalore
Data Science Coimbatore
Best Data Science Training in Hyderabad
ReplyDelete
Replies
seo masterJanuary 14, 2020 at 3:29 AM
i really like this article please keep it up. python training institute in pune
ReplyDelete
Replies
TrishanaMarch 2, 2020 at 11:27 PM
thank you for sharing this blog, it is very useful information for python learning.
python course bangalore
ReplyDelete
Replies
shreekaviMarch 7, 2020 at 4:13 AM
This blog is really awesome. I learned lots of informations in your blog. Keep posting like this...
German Classes in Chennai
German Classes in Bangalore
German Classes in Coimbatore
German Classes in Madurai
German Language Course in Hyderabad
German Language Course in Bangalore
German Courses in Bangalore
German classes in marathahalli
Tally Course in Bangalore
Ielts coaching in bangalore
ReplyDelete
Replies
sasiMarch 26, 2020 at 4:06 AM
It's a very awesome article! Thanks a lot for sharing information.
Selenium Training Institute in Bangalore
angularjs training in marathahalli
python course in hyderabad
Software Testing Course in Chennai
web designing course in coimbatore
Web Development courses in bangalore
Web Designing Course in bangalore
web designing course in madurai
Web development training in bangalore
Python Training in Bangalore
angularjs training in marathahalli
ReplyDelete
Replies
Riya RajMarch 28, 2020 at 4:34 AM
Great info. The content you wrote is very interesting to read. This will be loved by all age groups.
DevOps Training in Chennai
Best DevOps Training in Chennai
DevOps Training institute in Chennai
DevOps Training in Velachery
DevOps Training in Tambaram
DevOps Training in Adyar
DevOps Training in Vadapalani
ReplyDelete
Replies
latchu kannanJune 29, 2020 at 11:36 PM
Thank you so much for this useful article. I think it is valuable to so many people.
AngularJS training in chennai | AngularJS training in anna nagar | AngularJS training in omr | AngularJS training in porur | AngularJS training in tambaram | AngularJS training in velachery

ReplyDelete
Replies
jesiaJuly 5, 2020 at 10:00 PM
nice to learn this type of blog.

Microsoft Windows Azure Training | Online Course | Certification in chennai | Microsoft Windows Azure Training | Online Course | Certification in bangalore | Microsoft Windows Azure Training | Online Course | Certification in hyderabad | Microsoft Windows Azure Training | Online Course | Certification in pune

ReplyDelete
Replies
Tech Institute September 25, 2020 at 9:43 AM
Excellent blog information shared was very informative and valuable looking forward for next blog thank you.
Data Analytics Course Online 360DigiTMG
ReplyDelete
Replies
Data Science Training October 9, 2020 at 12:21 AM
Awesome article with top quality information and I appreciate the writer's choice for choosing this excellent topic found valuable thank you.
Data Science Training in Hyderabad
ReplyDelete
Replies
Online FrontDecember 14, 2020 at 3:44 AM
Thankyou for posting this informative blog, i come to know something new with this. Great Job! Keep it up.

1000 free youtube subscribers
ReplyDelete
Replies
HuongkvMarch 4, 2021 at 12:26 AM
Mua vé tại Aivivu, tham khảo

vé máy bay đi Mỹ giá rẻ

vé máy bay từ california về việt nam

vé máy bay giá rẻ sài gòn đà nẵng

vé máy bay sai gon ha noi

ve may bay sai gon nha trang
ReplyDelete
Replies
traininginstituteMay 7, 2021 at 12:43 AM
I am sure that this is going to help a lot of individuals. Keep up the good work. It is highly convincing and I enjoyed going through the entire blog.

business analytics course
ReplyDelete
Replies
360DigiTMGAurangabadMay 24, 2021 at 3:24 AM
Amazing blog.Thanks for sharing such excellent information with us. keep sharing...
machine learning course in aurangabad
ReplyDelete
Replies
360DigiTMGAurangabadMay 26, 2021 at 6:03 AM
Wonderful blog. I delighted in perusing your articles. This is genuinely an incredible perused for me. I have bookmarked it and I am anticipating perusing new articles. Keep doing awesome!
best machine learning course in aurangabad
ReplyDelete
Replies
bamgosoocomJuly 7, 2021 at 11:27 PM

I like what you guys tend to be up too. This kind of clever work and reporting! Keep up the very good works guys I’ve added you guys to our blogroll.

Try to check my blog: 부산달리기
(jk)
ReplyDelete
Replies
Arnold DKOctober 28, 2021 at 3:28 AM
it is valuable informative. nice blogs. thanks for sharing these information with all of us.Kinemaster Gold

ReplyDelete
Replies
Arnold DKNovember 16, 2021 at 12:33 AM
Nice blog, it is valuable informative. thanks for sharing these information with all of us. whatsapp mod
ReplyDelete
Replies
David FincherJanuary 17, 2022 at 9:39 PM
This post is so interactive and informative.keep update more information...
AWS Training in Anna Nagar
AWS Training in Chennai
ReplyDelete
Replies
traininginstituteFebruary 11, 2022 at 4:05 AM
I truly like you're composing style, incredible data, thankyou for posting.
cyber security training malaysia
ReplyDelete
Replies
Pavithra DeviFebruary 14, 2022 at 8:39 PM
This post is so interactive and informative.keep update more information...
hadoop training in tambaram
Big data training in chennai
ReplyDelete
Replies
data scienceMarch 4, 2022 at 7:24 PM
This is the first time I visit here. I found such a large number of engaging stuff in your blog, particularly its conversation. From the huge amounts of remarks on your articles, I surmise I am by all accounts not the only one having all the recreation here! Keep doing awesome. I have been important to compose something like this on my site and you have given me a thought.
ReplyDelete
Replies
PMP Training in MalaysiaMarch 7, 2022 at 11:17 PM
360DigiTMG, the top-rated organisation among the most prestigious industries around the world, is an educational destination for those looking to pursue their dreams around the globe. The company is changing careers of many people through constant improvement, 360DigiTMG provides an outstanding learning experience and distinguishes itself from the pack. 360DigiTMG is a prominent global presence by offering world-class training. Its main office is in India and subsidiaries across Malaysia, USA, East Asia, Australia, Uk, Netherlands, and the Middle East.
ReplyDelete
Replies
werwerApril 15, 2022 at 11:43 PM
Autodesk Revit Crack stretches a full key to the whole construction scheme facet and provision triggers. The designers, constructing corporations to clarify, condition more¬ well-versed selections beforehand. They give techniques a bit more skillfully. It's novel variations of user¬ demanded exploits, positive Global Boundaries https://freeprosoftz.com/autodesk-revit-crack-key/
ReplyDelete
Replies
newcrackkeyMay 27, 2022 at 1:09 PM
With iTop VPN Crack, clients might conceal their characters and explore namelessly. Utilizing top VPN break, clients might shield their specifically distinguishing data from outer associations. Organizations might impart data to different organizations, restricting their ability to apply fundamental attributes.Itop vpn crack download pc
ReplyDelete
Replies
Career Program and Skill DevelopmentJune 27, 2022 at 1:03 PM
Boost your professional reputation with a surefire way to pick up some impressive new skills in data science by registering for the Data science courses near me. Learn to collect, clean, and analyze data with tools like Hadoop and Spark. Learn to develop algorithms and build models in machine learning to optimize product performance and gross profit for your organization. Become an expert in techniques like Data Mining, Data Cleansing, and Data Exploring that help refine data, making it possible to present it in an understandable format.

Best Data Science Training institute in Bangalore
ReplyDelete
Replies
jahanzaib33July 18, 2022 at 11:36 PM
Fine page, in which did u come happening a distant memory the assessment concerning this posting?i have right of access the majority of the articles with respect to your web website now, and I as a matter of fact in addition to your style. much thanks to you a million and absorb save happening the vivacious deed. Reloader Ultima Versioner
ReplyDelete
Replies

Subscribe to: Post Comments (Atom)