7.6 KiB
3.4 Modules
This section introduces the concept of modules.
Modules and import
Any Python source file is a module.
# foo.py
def grok(a):
...
def spam(b):
...
The import statement loads and executes a module.
# program.py
import foo
a = foo.grok(2)
b = foo.spam('Hello')
...
Namespaces
A module is a collection of named values and is sometimes said to be a namespace. The names are all of the global variables and functions defined in the source file. After importing, the module name is used as a prefix. Hence the namespace.
import foo
a = foo.grok(2)
b = foo.spam('Hello')
...
The module name is tied to the file name (foo -> foo.py).
Global Definitions
Everything defined in the global scope is what populates the module
namespace. foo in our previous example. Consider two modules
that define the same variable x.
# foo.py
x = 42
def grok(a):
...
# bar.py
x = 37
def spam(a):
...
In this case, the x definitions refer to different variables. One
is foo.x and the other is bar.x. Different modules can use the
same names and those names won't conflict with each other.
Modules are isolated.
Modules as Environments
Modules form an enclosing environment for all of the code defined inside.
# foo.py
x = 42
def grok(a):
print(x)
Global variables are always bound to the enclosing module (same file). Each source file is its own little universe.
Module Execution
When a module is imported, all of the statements in the module execute one after another until the end of the file is reached. The contents of the module namespace are all of the global names that are still defined at the end of the execution process. If there are scripting statements that carry out tasks in the global scope (printing, creating files, etc.) you will see them run on import.
import as statement
You can change the name of a module as you import it:
import math as m
def rectangular(r, theta):
x = r * m.cos(theta)
y = r * m.sin(theta)
return x, y
It works the same as a normal import. It just renames the module in that one file.
from module import
This picks selected symbols out of a module and makes them available locally.
from math import sin, cos
def rectangular(r, theta):
x = r * cos(theta)
y = r * sin(theta)
return x, y
It allows parts of a module to be used without having to type the module prefix. Useful for frequently used names.
Comments on importing
Variations on import do not change the way that modules work.
import math as m
# vs
from math import cos, sin
...
Specifically, import always executes the entire file and modules
are still isolated environments.
The import module as statement is only manipulating the names.
Module Loading
Each module loads and executes only once. Note: Repeated imports just return a reference to the previously loaded module.
sys.modules is a dict of all loaded modules.
>>> import sys
>>> sys.modules.keys()
['copy_reg', '__main__', 'site', '__builtin__', 'encodings', 'encodings.encodings', 'posixpath', ...]
>>>
Locating Modules
Python consults a path list (sys.path) when looking for modules.
>>> import sys
>>> sys.path
[
'',
'/usr/local/lib/python36/python36.zip',
'/usr/local/lib/python36',
...
]
Current working directory is usually first.
Module Search Path
sys.path contains the search paths.
You can manually adjust if you need to.
import sys
sys.path.append('/project/foo/pyfiles')
Paths are also added via environment variables.
% env PYTHONPATH=/project/foo/pyfiles python3
Python 3.6.0 (default, Feb 3 2017, 05:53:21)
[GCC 4.2.1 Compatible Apple LLVM 8.0.0 (clang-800.0.38)]
>>> import sys
>>> sys.path
['','/project/foo/pyfiles', ...]
Exercises
For this exercise involving modules, it is critically important to make sure you are running Python in a proper environment. Modules are usually when programmers encounter problems with the current working directory or with Python's path settings.
(a) Module imports
In section 3, we created a general purpose function parse_csv() for parsing the contents of CSV datafiles.
Now, we’re going to see how to use that function in other programs. First, start in a new shell window. Navigate to the folder where you have all your files. We are going to import them.
Start Python interactive mode.
bash % python3
Python 3.6.1 (v3.6.1:69c0db5050, Mar 21 2017, 01:21:04)
[GCC 4.2.1 (Apple Inc. build 5666) (dot 3)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>>
Once you’ve done that, try importing some of the programs you previously wrote. You should see their output exactly as before. Just emphasize, importing a module runs its code.
>>> import bounce
... watch output ...
>>> import mortgage
... watch output ...
>>> import report
... watch output ...
>>>
If none of this works, you’re probably running Python in the wrong directory.
Now, try importing your fileparse module and getting some help on it.
>>> import fileparse
>>> help(fileparse)
... look at the output ...
>>> dir(fileparse)
... look at the output ...
>>>
Try using the module to read some data:
>>> portfolio = fileparse.parse_csv('Data/portfolio.csv',select=['name','shares','price'], types=[str,int,float])
>>> portfolio
... look at the output ...
>>> pricelist = fileparse.parse_csv('Data/prices.csv',types=[str,float], has_headers=False)
>>> pricelist
... look at the output ...
>>> prices = dict(pricelist)
>>> prices
... look at the output ...
>>> prices['IBM']
106.11
>>>
Try importing a function so that you don’t need to include the module name:
>>> from fileparse import parse_csv
>>> portfolio = parse_csv('Data/portfolio.csv', select=['name','shares','price'], types=[str,int,float])
>>> portfolio
... look at the output ...
>>>
(b) Using your library module
In section 2, you wrote a program report.py that produced a stock report like this:
Name Shares Price Change
---------- ---------- ---------- ----------
AA 100 39.91 7.71
IBM 50 106.11 15.01
CAT 150 78.58 -4.86
MSFT 200 30.47 -20.76
GE 95 37.38 -2.99
MSFT 50 30.47 -34.63
IBM 100 106.11 35.67
Take that program and modify it so that all of the input file
processing is done using functions in your fileparse module. To do
that, import fileparse as a module and change the read_portfolio()
and read_prices() functions to use the parse_csv() function.
Use the interactive example at the start of this exercise as a guide. Afterwards, you should get exactly the same output as before.
(c) Using more library imports
In section 1, you wrote a program pcost.py that read a portfolio and computed its cost.
>>> import pcost
>>> pcost.portfolio_cost('Data/portfolio.csv')
44671.15
>>>
Modify the pcost.py file so that it uses the report.read_portfolio() function.
Commentary
When you are done with this exercise, you should have three
programs. fileparse.py which contains a general purpose
parse_csv() function. report.py which produces a nice report, but
also contains read_portfolio() and read_prices() functions. And
finally, pcost.py which computes the portfolio cost, but makes use
of the code written for the report.py program.