Metadata-Version: 2.1
Name: pynetwork
Version: 2.1
Summary: package to manage connection swarms for your network workloads
Home-page: https://github.com/nirajkale/pynet
Author: Niraj S. Kale
Author-email: nirajkale157@outlook.com
License: UNKNOWN
Description:   # pynetwork
          ##  High performance socket swarms for network workloads
          
          ### Important Updates:
          
          Due to some name conflict in pypi, i had to change the name of the library from "pynet" to "pynetwork"
          
          ### Abstract
        
          pynet package is designed to help write scripts/projects that rely on interprocess communication over a network with minimal-to-no       knowledge of sockets. Furthermore these data transfer/steaming can be done in a swarm fashion, meaning multiple or swarm of persitant   tcp connections that serve a single objective to deliver high performance.
          Following objectives can be achieved using this package:
        
          1. Write subroutines/functions that can stream data over to your client using callbacks (No need of polling)
          2. Write subroutines/functions that can send batches of data over to your client
          3. Send data to your subroutines/function over network
          4. Send files to & from server using swarm of connections for higher transfer rates
          
          ### Installation
          
          I've added this package to pypi package index,so now you can install it using pip: <br/>
          
          pip install pynetwork
          
          ### Prelude
          
          As a fun project, i wanted a raspberry pi based RC car that can be controlled over wi-fi. On the receiver end, a combination of
          different deep neural network architectures that can consume the data from raspberry pi like camera feed ( to a CNN model), feed from
          various sensors e.g proximity, speed encoder etc ( to a RNN) & then can subsequently control it. But one of the keys problems that i
          faced while implementing it, was how to transfer all of this data back & forth between my laptop & pi in real time with low
          latency. This gave me an idea write a simple JSON based networking code that could transfer custom python objects back & forth, but
          that was not enough because i was dealing multiple data feeds & managing these multiple tcp connections was getting difficult. After
          some useless searches on github, i decided to buckle up and write down a networking library from scratch that could manage swarm of
          connections & also could help stream the data from python subroutines (These subroutines were pythonian generators that could
          endlessly read & stream the individual sensor data). Apart from subroutines based streaming, I also wanted to add batch fashioned
          suboutine data transfer (As you may have realized lot of these ideas sound similar to Keras generators that are used for feeding data
          to a machine learning model ). After a month of after-office coding & testing, I am finally releasing v2 of pynetwork. This version should
          not only be suitable for IOT but can be used for any network workload.
        
          ### Before jumping on to some use cases, below is some nomenclatures & components that you should be aware before using it:
          
          #### Gateway <--> Controller
          
          There are 4 main components involved namely: Gateway, Controller, Handler & a Client. Gateway acts as a small server which listens
          to requests made by Controller on a specified port. Gateway & Controller share a Many-to-Many relationship (One Gateway can respond
          to multiple Controllers & one Controller can connect to multiple Gateways). Gateway-controller themselves do not perform any workload
          operation (like data/file transfer) but instead they are responsible spawning & managing a swarm\s of tcp connections to handle 
          your workloads. Whenever a controller requests to a Gateway, Gateway spawns a Handler & sends an acknowledge to the Controller
          then in response, Controller creates a Client for that Handler. So 1 request results in spawning of a pair of "Handler <-> Client". 
          A handler & a client work in tandem to handle your custom workload. To improve speed of your operation, you can spawn multiple pairs
          of Handler & client. Each handler spwned at the Gateway runs on a independent thread (this is where the performance boost comes  
          from) & these handler threads are then managed by the Gateway. The number of possible Handlers<->Clients that can be spawned depends
          the size of Handler pool (Basically a custom thread pool). By default the pool size is 5 but you can increase it to any number
          depending on your hardware.
          
          **The swarm architecture looks something like this** 
          
          
          <img src="https://user-images.githubusercontent.com/40765055/58163266-53432900-7ca1-11e9-8c94-928eb364faf7.jpg" /></p>
        
          #### Handler <--> Client
          
          An individual pair of handler-client offer various options of data transfers:
          
          1. **Receive data stream from a handler subroutine**
          
                You can write python generator function that endlessly yield data (byte array) and register/add them to Gateway. Once added to
                Gateway, these subroutines are passed to any newly spawned handler. A client can then ask handler to stream data from this
                generator subroutine until it raises a StopIteration exception or returns None.
                Examples of such streaming could be:
                a. Streaming multiple sensor data
                b. Streaming key-presses to a key-logger over the network
                The data which is streamed from a generator is of byte array type data, along with this byte array data, function also needs to
                return an "id" (or idetity). This id can be perticualy useful when your using a swarm of connections to stream the data, because 
                if the same function is being used my multiple handlers then there is no gaurantee of the same order in which the data was 
                returned by the function would be preserved on the received end. In this case, you can use this id to re-arrange your data if 
                needed.
                
                **Steps to follow to use this functionality:**
                
          a. write a python 'generator' function that yeilds byte array data
             The return signature expected from this function is a 'python set' with 2 elements '(ident, buffer)'
             where ident is an integer which represents an index for this chunk from your steam & buffer is an byte array (which is data
             to be streamed).
        
          ```python
          import pynetwork.backend2 as bk
        
          def your_function_name(arg1, arg2,.., **kwargs): #parameters are optional
            index =1
            while condition:
              pass
              obj = your_logic_to_generate_data() #lets your data is a custom python obj
              #if its an object, then you can use your own custom json encode/decoder
              obj_str = your_json_encoder( obj)
              yield (index, bk.str_to_bytes(obj_to_str))
          ```
          
          b. you can use any of the byte conversion methods from backend2 module of the package. You can convert int, str, or JSON str,
             or objects (with default valued constructor) to byte array
        
          c. The streaming will terminate when the generator raises an 'StopIteration' error or ident<0 or buffer is 'None'
          d. you can also terminate streaming from client side by returning 'False' in your streaming callback (User requested stream
             abort)
        
          e. If streaming ends (without User requested stream abort) then client calls 'eos_callback'
        
          f. You can also pass positional arguments & kwargs to this generator when you start the stream. 
             (Package serializes your args & kwargs and then these params get deserialized at gateway where the function is executed 
             with same params)
                   
                
          2. **Receive data batch from a handler subroutine**
            
                If you need the data on transactional basis without the need of steams, you can opt for batch subroutines. These are simple 
                python function (which doesn't need to be written as a generator) that return byte data (with unique id for that batch), then 
                handler would steam this data across to client.
          
          3. **Send batch of data to handler subroutine**
          
                You can also send data to a subroutine whcih is registered with Gateway. This data is available as parameter called 
                **buffer** in your function's **kwargs** input. This is additional byte array input, apart from the usual arguments & kwargs 
                that you can pass to your subroutine. After the execution of subroutine, handler expects an integer return that is transferred
                back to the client as an output.
                
                **Steps to follow to use this functionality:**
                
          a. Write a function (with whatever positional/ kwargs you want) & regirster it with gateway
             package expects that this function would return an int which is then beamed back to client
        
          b. Since gateway passes an 'buffer' paramater through kwargs dictionary, this method needs to strictly follow below signatue:
        
          ```python
          def your_function_name(arg1, arg2,.., **kwargs):
            result = 2
            pass
            return result #return your integer result
          ```
          
          c. You can also pass positional arguments & kwargs to this generator when you start the stream. 
             (Package serializes your args & kwargs and then these params get deserialized at gateway where the function is executed 
             with same params)
                   
          
          4. **Download files from Gateway to client device (from folder name & regex or fullpath to files)**
          
                No need of description here apart from the fact that, you can speed up file transfer by distributing the load across multiple 
                connections. One important thing to note here is that, the speed improvement would be negligible if the files are less numerous 
                & are large in size in which case your bandwidth would be the limiting factor. but in case of numerous & small sized files, you
                might see a conisderable improvement.(So swarm could be perticaularly useful if you are transferring log files from you
                application where file count is usually high & size of each log is limted to few hundered MBs)
                *I would soon upload few stats to support above argument*
          
          5. **Send files to Gateway (files are downloaded to relative said folder)**
          
               Same as above, except the direction is opposite.
              
          6. **Ping Handler to check the connection**
          
              Could be useful to test the connection.
          
          Below are some of the use case tutorials using pynet with increasing complexity: 
        
        
          ## Tutorial 1: Conditional File Backup that uses batch data transfer
          
          Consider a scenario where you have a limited storage on a VM node or a server and would like to to move older files from this device
          to your local machine when the storage usage crosses 20% (starting from the older files). This objective can be achieve as below:
          
          #### On Gateway side
          ```python
        import pynetwork as pynet
        import pynetwork.backend2 as bk
        import struct
        import shutil
        import os
        
        def get_usage_percentage(request_id:int, **kwargs):
            '''
            Logic to calculate mount/overall hdd usage
            '''
            total, used, free = shutil.disk_usage("\\")
            usage_per = used/ total
            bk.safe_print('usage', usage_per) # safe_print make sure that output from multiple client/handler threads to console do not mix-up
            b = struct.pack('f', usage_per)    #convert the data bytes that needs to be sent back
            return b
        
        def remove_logs(dirpath:str, **kwargs):
            files = [os.path.join(dirpath,f) for f in os.listdir(dirpath)]
            for f in files: #iterate over files delete each one
                os.remove(f)
            bk.safe_print(len(files),' files deleted')
            return bk.int_to_bytes(len(files)) 
        
        if __name__ == '__main__':
        
            gw = pynet.Gateway(1857) #start gateway at port 1857
            gw.add_subroutine('mount_usage', get_usage_percentage)
            gw.add_subroutine('remove_logs', remove_logs)
            #add above two subroutines with a key, that client can pass to request execution
            gw.start(blocking = True) #start listening to controller
        ```
          
          #### On Controller side
          
          ```python
        import pynetwork as pynet
        import pynetwork.backend2 as bk
        import struct
        
        if __name__ == '__main__':
        
            controller = pynet.Controller(gateway_ip = '', port = 1857, download_dir = 'mydir', verbose = True) 
            client1 = controller.get_client()
            client1.ping('Hello there..')
            b = client1.get_subroutine_batch(name ='mount_usage',arguments=[123,])
            usage = struct.unpack('f', b)[0]
            bk.safe_print('mount usage:', usage)
            if usage > 0.2:
                bk.safe_print('downloading files..')
                client1.get_files_from_gateway(folder = 'D:\logs', regex = '.+RFIN703235761L') #using regex, u can filter files within a folder
                #in this case, we are only downloading log file belonging to tensorflow as a test
                count_bytes = client1.get_subroutine_batch(name ='remove_logs',arguments=['D:\logs',])
                bk.safe_print(bk.bytes_to_int(count_bytes),' files deleted') 
            else:
                pass
            client1.close_handler() #close handler so the handler pool at gateway will remain empty for others
            #additionally you can also stop Gatway by: client1.close_gateway()
        
        ```
          Additionaly, you can also add logic in 'remove_logs' function to remove files before a specific date & so on. Since you can integrate   any custom logic in your python subroutine, the possibilities are endless.
          
          #### Few things to be considered while writing a subroutine for receiving batch data (Handler -> Client):
          
          a. Function can have *arguments* & *kwargs* as an input, here arguments represent positional arguments (Against the usual python            convetion)
             *sorry for this mess, i will change the name in next release*
          b. The function need to return the data in byte format. So function can either one of the methoeds available in backed to convert int,
              str to byes (I would add support float soon, but meanwhile you can use struct package). Furthermore, if you want to return a 
              custom class then you use your own custom encode to serialize object to JSON string & then using backed to convert this string to
              bytes, then on the controller side do the opposite.
              **If you want to use the object serialization which available in backend then you can add your class to the "Handshakes file" to 
              use in-built serializtion, but it is important that you have primitive data-types in your class & have default values assigned to 
              all the input parameters in __init__ method of your class.**
             *I will make some change in serialization to give more freedom for class defination in next release*
             
          #### Few things to be considered while writing a subroutine for transmitting batch data (Client -> Handler):
          
          a. Function can have *arguments* & *kwargs* as an input, here arguments represent positional arguments (Against the usual python 
             convetion)
          b. The function need to return the integer result in byte format which will be beamed back to client as a result
          
        ## Tutorial 2: Raspberry-Pi control over wifi (uses both streaming & batch data transfer)
        
          *Work in progress.. Will post in 1-2 days!*
        
        
        
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Description-Content-Type: text/markdown
