获取 Jupyter Notebook 中定义的对象的源

And*_*ott 7 python inspect python-3.x jupyter-notebook

通常,如果你想获取一个对象的来源,你可以通过inspect模块获取:

import inspect
inspect.getsource(MyObject)
Run Code Online (Sandbox Code Playgroud)

但是,在 Jupyter 笔记本中,这不起作用:

import inspect

class Foo:
    def __init__(self, info):
        self.info = info

a = Foo("hi")

inspect.getsource(a)
Run Code Online (Sandbox Code Playgroud)

抛出错误:

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-14-048b6f0c2e9b> in <module>()
      7 a = Foo("hi")
      8 
----> 9 inspect.getsource(a)

/usr/lib/python3.6/inspect.py in getsource(object)
    963     or code object.  The source code is returned as a single string.  An
    964     OSError is raised if the source code cannot be retrieved."""
--> 965     lines, lnum = getsourcelines(object)
    966     return ''.join(lines)
    967 

/usr/lib/python3.6/inspect.py in getsourcelines(object)
    950     raised if the source code cannot be retrieved."""
    951     object = unwrap(object)
--> 952     lines, lnum = findsource(object)
    953 
    954     if ismodule(object):

/usr/lib/python3.6/inspect.py in findsource(object)
    763     is raised if the source code cannot be retrieved."""
    764 
--> 765     file = getsourcefile(object)
    766     if file:
    767         # Invalidate cache if needed.

/usr/lib/python3.6/inspect.py in getsourcefile(object)
    679     Return None if no way can be identified to get the source.
    680     """
--> 681     filename = getfile(object)
    682     all_bytecode_suffixes = importlib.machinery.DEBUG_BYTECODE_SUFFIXES[:]
    683     all_bytecode_suffixes += importlib.machinery.OPTIMIZED_BYTECODE_SUFFIXES[:]

/usr/lib/python3.6/inspect.py in getfile(object)
    661         return object.co_filename
    662     raise TypeError('{!r} is not a module, class, method, '
--> 663                     'function, traceback, frame, or code object'.format(object))
    664 
    665 def getmodulename(path):

TypeError: <__main__.Foo object at 0x7fb9130ee518> is not a module, class, method, function, traceback, frame, or code object
Run Code Online (Sandbox Code Playgroud)

Foo如果我尝试找到(using )的来源inspect.getsource(Foo),我会得到:

TypeError: <module '__main__'> is a built-in class

如何获取 Jupyter Notebook 中定义的类的源代码?

小智 8

我找到了一种“黑客方式”来获取 Jupyter Notebook 中的类的源代码。

假设在一个单元格中有:

class MyClass:
    test = 2
    
    def __init__(self):
        self.L = 5
    
    def test(self, x):
        return True
    
    @classmethod
    def forward(cls, x):
        return x
Run Code Online (Sandbox Code Playgroud)

然后您可以使用以下命令提取代码:

import inspect
from IPython.core.magics.code import extract_symbols

obj = MyClass
cell_code = "".join(inspect.linecache.getlines(new_getfile(obj)))
class_code = extract_symbols(cell_code, obj.__name__)[0][0]
print(class_code)
Run Code Online (Sandbox Code Playgroud)

new_getfile这里定义:

import inspect, sys

def new_getfile(object, _old_getfile=inspect.getfile):
    if not inspect.isclass(object):
        return _old_getfile(object)
    
    # Lookup by parent module (as in current inspect)
    if hasattr(object, '__module__'):
        object_ = sys.modules.get(object.__module__)
        if hasattr(object_, '__file__'):
            return object_.__file__
    
    # If parent module is __main__, lookup by methods (NEW)
    for name, member in inspect.getmembers(object):
        if inspect.isfunction(member) and object.__qualname__ + '.' + member.__name__ == member.__qualname__:
            return inspect.getfile(member)
    else:
        raise TypeError('Source for {!r} not found'.format(object))
inspect.getfile = new_getfile
Run Code Online (Sandbox Code Playgroud)


Kev*_* He 1

使用inspect.getsource(inspect.getfile)我们可以获得处理此问题的一段代码:

...
if isclass(object):
    if hasattr(object, '__module__'):
        object = sys.modules.get(object.__module__)
        if hasattr(object, '__file__'):
            return object.__file__
    raise TypeError('{!r} is a built-in class'.format(object))
...
Run Code Online (Sandbox Code Playgroud)

在 ipython 或 Jupyter 笔记本中,定义的类/函数或__main__模块似乎没有__file__与其关联的属性,因此inspect无法检索源文件。.py在这种情况下,您可以在单独的文件中定义类,以便inspect能够检索与其关联的文件。