Metadata-Version: 2.1
Name: pytorch-pyramid-pooling
Version: 0.0.6
Summary: A PyTorch implementation of Spatial and Temporal Pyramid Pooling
Home-page: https://github.com/eric-musa/pytorch-pyramid-pooling
Author: Eric Musa
Author-email: eric.musa17@gmail.com
License: UNKNOWN
Project-URL: Bug Tracker, https://github.com/eric-musa/pytorch-pyramid-pooling/issues
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/markdown
License-File: LICENSE

# Pyramid Pooling implemented in PyTorch
This Module implements Spatial Pyramid Pooling (SPP) and Temporal Pyramid Pooling (TPP) as described in different papers.

**NOTE** This repo has been forked from <a href="https://github.com/revidee/pytorch-pyramid-pooling">Revidee's original</a> (which seems to no longer be under maintenance) to publish on Pypi. I (Eric Musa) take no credit for writing the Pyramid Pooling code, but could not find any information on Revidee to provide proper credit.

# The purpose of this fork is to continue maintenance of this package and post on Pypi


![SPP-TPP Comparison](https://github.com/eric-musa/pytorch-pyramid-pooling/blob/master/comparison-spp-tpp.png "SPP-TPP Comparison")


Temporal Pyramid Pooling:
------
[Sudholt, Fink: Evaluating Word String Embeddings and LossFunctions for CNN-based Word Spotting](http://patrec.cs.tu-dortmund.de/pubs/papers/Sudholt2017-EWS.pdf "Sudholt, Fink: Evaluating Word String Embeddings and LossFunctions for CNN-based Word Spotting")

### Principle
Given an 2D input Tensor, Temporal Pyramid Pooling divides the input in **x** _stripes_ which **extend through the height** of the image and **width of roughly (input_width / x)**. These stripes are then each pooled with max- or avg-pooling to calculate the output.

### Animated Principle
![TPP Visualization](https://github.com/eric-musa/pytorch-pyramid-pooling/blob/master/pytorch-tpp-visual.gif "TPP Visualization")

Spatial Pyramid Pooling:
------
[He, et. al.: Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition](https://arxiv.org/abs/1406.4729 "He et. al.: Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition")

### Principle
Given an 2D input Tensor, Spatial Pyramid Pooling divides the input in **x²** _rectangles_ with **height of roughly (input_height / x)** and **width of roughly (input_width / x)**. These rectangles are then each pooled with max- or avg-pooling to calculate the output.



