Site icon R-bloggers

Eight Advantages of Python Over Matlab

[This article was first published on R – Modern Data, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
The original post is here. Dr. Phillip Feldman has kindly agreed to let us repost his thoughts on some key differences between Matlab and Python.

1. readability

Python code tends to be more compact and more readable than Matlab code. There are several reasons for this:

indentation

Unlike Matlab, which uses end statements as closures (i.e., to indicate the end of a block), Python determines the scope of a block based on its indentation. This forces Python programmers to indent code blocks, which is good practice anyway. (And, unlike programmers working in most other languages, no Python programmer ever wastes time hunting for a missing end statement).

[] vs ()

Like almost every programming language other than Matlab, Python uses square brackets for indexing and parentheses for function and method calls. Matlab uses parentheses for both. Python’s use of square brackets for indexing is important for readability, and also makes life easier for programmers who must work with multiple languages. Matlab’s use of parentheses for both indexing and function calls can make Matlab code hard to understand and is a frequent source of bugs.

augmented assignments

Support for augmented assignments tends to make Python code less verbose than Matlab code. Compare the following two fragments of code, which collect counts for a 2-D histogram: Matlab:
count(max(floor(x),1),max(floor(y),1))= …
count(max(floor(x),1),max(floor(y),1)) + 1
Python:
count[max(floor(x),0),max(floor(y),0)]+= 1
Note that the Matlab code is roughly twice as long. The chances of a typographical error are greater when entering this code, and it takes longer for someone to read and understand it. In general, Python code is almost always smaller (in terms of lines of code, assuming only one statement and no more than 80 characters per line) than Matlab code that performs the same task. For code that performs primarily numerical operations, one can expect a Python implementation to be 10 to 20 percent smaller than equivalent Matlab code. For programs that perform extensive string processing, the advantage of Python is even greater.

automatic broadcasting

Python’s support for automatic broadcasting allows for cleaner, more readable code. The following is a sample of Matlab code that performs element-wise subtraction and multiplication of arrays with broadcasting, where: A has dimensions 20 x 1 x 15, B is 20 x 12 x 1, and C is 1 x 12 x 15:
D= bsxfun(@times, bsxfun(@minus,B,C), A);
The corresponding Python/NumPy code below looks exactly like the equation that it implements:
D = A*(B-C)

literals

Python does not arbitrarily restrict the use of literals in expressions. Here is some sample input/output from an IPython session:
In[1]: n=3
In[2]: [1,2,4,7,11,14,16,17][n]
Out[2]: 7
Now try the same thing in Matlab:
>>n= 4 % Matlab indices start from 1
>>1,2,4,7,11,14,16,17
Error: Unbalanced or unexpected parenthesis or bracket
The better readability of Python is much more than a matter of aesthetics. Better readability leads to fewer bugs, faster debugging when bugs are introduced, and faster comprehension when one must work with someone else’s code (or with one’s own code after a long hiatus).

2. zero-based indexing

Like almost every programming language other than Matlab, Python uses zero-based indexing, i.e., if x is a one-dimensional array, the first element is x[0]—not x[1]. Because virtually every signal processing algorithm in the literature uses summations in which indices start at zero, Matlab’s one-based indexing forces implementers to translate the indices, which is both inconvenient and a source of bugs.

3. dictionaries

Python offers excellent support for dictionaries (hashes). These are like arrays, except that the index (key) is not constrained to be an integer. If one codes a compiler or interpreter, a dictionary is a natural and highly efficient way to implement a symbol table. Dictionaries also have many applications in engineering and scientific programming; their power and usefulness cannot be overstated.

4. object-oriented

Object-oriented programming (OOP) in Python is simple and elegant, and offers power/flexibility comparable to that of C++. Matlab’s object-oriented programming scheme is complex and confusing.

5. free and open-source

Python is free and open-source, which has many advantages:

contributing

Because Python is open-source, one can always look under the hood to see how things have been implemented. Much of Matlab, on the other hand, is closed and proprietary, and in some cases even the algorithms are proprietary. (Simulink, which is a simulation toolbox for use with Matlab, has a proprietary scheduling algorithm).

Cost

Matlab licenses are fairly expensive, and each Matlab toolbox is licensed individually. Python, on the other hand, is open-source and free. Creating one’s own build of Python with a reasonable set of libraries is a non-trivial exercise, but high-quality distributions are available that include NumPy, SciPy, IPython (an interactive shell), matplotlib, Pandas, and several other useful libraries.

community

The Python world is open in another sense: There is a vigorous, healthy public debate on virtually all aspects of Python, with input from any developer or user who wants to participate. [I have lobbied successfully for minor changes to NumPy, the interactive IPython shell, and the mapping component of matplotlib.] In stark constrast, the Matlab developers at the Mathworks make decisions behind closed doors and tend to be unreceptive or even hostile to suggestions from the outside.

6. script proliferation

As per Mathworks: “MATLAB program files can contain code for more than one function. The first function in the file (the main function) is visible to functions in other files, or you can call it from the command line. Additional functions within the file are called local functions. Local functions are only visible to other functions in the same file.” Because local functions are not accessible outside of the file in which they were defined, developers working on complex Matlab-based projects tend to be overwhelmed by a plethora of small files.

7. function imports

There is no Matlab counterpart to Python’s import statement. (Matlab searches the Matlab path—a list of folders—to resolve references to functions that are neither built-in nor defined within the same file). Also, when the name of a Matlab source code file differs from the name of the main (initial) function in that file, the file name overrides the function name appearing in the code. As a consequence of these two design flaws, one can’t have multiple versions of the same function unless they have different names or the Matlab path is continually changed. When a large Matlab application is divided among many files, the locations of these files are critical, and must be consistent with the Matlab path. In particular, Matlab’s OOP scheme is tied to the folder structure of the computer —an incredibly ill-conceived concept that makes it nearly impossible to maintain multiple versions of the same shared source code libraries. Following an upgrade of a shared library, one has three choices: Python allows one to organize classes and functions into modules and packages, with the module or package name being used to resolve any name conflicts. Python’s import command gives one precise control over what components are used by any program.

8. more graphics options

Python offers a wider set of choices in graphics packages and toolsets:

summary


Don’t forget to visit the original author Dr. Phillip M Feldman’s page !

To leave a comment for the author, please follow the link and comment on their blog: R – Modern Data.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.