Input & Output Adapters
With Qwak, you can customize the input and output formats of your ML models through the mechanism of input and output adapters.
The purpose of the adapters is to validate that the model inputs & outputs conform to the expected format and perform the corresponding type conversions.
Currently, the Qwak platform supports the following adapters:
Supported Adapters
Output Adapters
DataFrameOutputAdapter
DefaultOutputAdapter
AutodetectOutputAdapter
JsonOutputAdapter
ProtoOutputAdapter
TfTensorOutputAdapter
NumpyOutputAdapter
Input Adapters
DataframeInputAdapter
FileInputAdapter
ImageInputAdapter
JsonInputAdapter
ProtoInputAdapter
StringInputAdapter
TfTensorInputAdapter
NumpyInputAdapter
MultiInputAdapter
Adapter Types
Image
In the model file, we have to import the ImageInputAdapter
import numpy as np
import pandas as pd
from qwak.model.adapters import ImageInputAdapter
@qwak.api(analytics=False, input_adapter=ImageInputAdapter())
def predict(self, input_data) -> pd.DataFrame:
Now, in the predict
function, we get a list of arrays containing the RGB properties of the image pixels. If we pass a 28px x 28px image, we get an array with shape (28, 28, 3). Of course, if we pass a grayscale image, we will get a (28, 28, 1) array.
If you trained your model using grayscale pictures but pass RGB values in production, remember to convert the input to grayscale. For example like this:
@qwak.api(analytics=False, input_adapter=ImageInputAdapter())
def predict(self, input_data) -> pd.DataFrame:
def rgb2gray(rgb):
return np.dot(rgb[...,:3], [0.2989, 0.5870, 0.1140])
result = []
for image in input_data:
gray = rgb2gray(image)
gray = gray / 255.0
prediction_input = (np.expand_dims(gray, 0))
prediction = self.probability_model.predict(prediction_input)
result.append(prediction[0])
return pd.DataFrame(result)
File
We can pass the image as a file data stream and load it as a file inside the predict
function. Of course, sending a file works with any data format, not only images. However, in this example, we will use the same model as in the ImageInputAdapter example, but with a file adapter.
Before we start, we have to add the Pillow
library to the model dependencies and import the Image
class and the input adapter
import numpy as np
import pandas as pd
from PIL import Image
from qwak.model.adapters import FileInputAdapter
Now, we can change the input_adapter parameter in the qwak.api
decorator:
@qwak.api(analytics=False, input_adapter=FileInputAdapter())
def predict(self, file_streams) -> pd.DataFrame:
In the predict
function, we will iterate over the files in the file_stream
, load them as images, convert them to grayscale and resize them to the size required by the trained model. After that, we can pass the image data to the model and get the prediction.
result = []
for fs in file_streams:
im = Image.open(fs).convert(mode="L").resize((28, 28))
prediction_input = (np.expand_dims(im, 0))
prediction = self.probability_model.predict(prediction_input)
result.append(prediction[0])
return pd.DataFrame(result)
String
If we want to pass a single sentence to the ML model, we can use the StringInputAdapter
.
First, we have to import it
from qwak.model.adapters import StringInputAdapter
Now, we can configure the predict function to use the input adapter:
@qwak.api(analytics=False, input_adapter=StringInputAdapter())
def predict(self, texts) -> pd.DataFrame:
The texts
variable will contain a list of string values. We can iterate over it and pass them to the model.
For example, if we added the StringInputAdapter to our example Pytorch text classifier, it would look like this:
@qwak.api(analytics=False, input_adapter=StringInputAdapter())
def predict(self, texts) -> pd.DataFrame:
text_pipeline = lambda x: self.vocab(self.tokenizer(x))
responses = []
for text in texts:
with torch.no_grad():
text = torch.tensor(text_pipeline(text))
output = self.model(text, torch.tensor([0]))
responses.append(output.argmax(1).item() + 1)
return pd.DataFrame.from_dict({'label': responses, 'text': texts})
JSON
If you want to use your model in a front-end application, you will probably send JSON to the server. We can handle the JSON automatically by using the JsonInputAdapter
.
We must import the adapter first and configure the predict
function
from qwak.model.adapters import JsonInputAdapter
@qwak.api(analytics=False, input_adapter=JsonInputAdapter())
def predict(self, json_objects) -> pd.DataFrame:
Then, we can iterate over the json_objects and pass the text to the model:
@qwak.api(analytics=False, input_adapter=JsonInputAdapter())
def predict(self, json_objects) -> pd.DataFrame:
text_pipeline = lambda x: self.vocab(self.tokenizer(x))
responses = []
for json in json_objects:
with torch.no_grad():
text = torch.tensor(text_pipeline(json['text']))
output = self.model(text, torch.tensor([0]))
responses.append(output.argmax(1).item() + 1)
return pd.DataFrame.from_dict({'label': responses, 'text': json_objects})
To send a request to the deployed model, we must remember to specify the Content-Type: application/json
.
Proto
If you use the protobuf library in your software, you may also want to use it for communication with your ML model. It is common to use protobuf for both input and output formats, so our example will show both.
Let's assume that we have the following protobuf definition of the input data
syntax = "proto3";
package qwak.demo;
option java_multiple_files = true;
option java_package = "com.qwak.ai.demo";
message ModelInput {
int32 f1 = 1;
int32 f2 = 2;
}
and the output data:
syntax = "proto3";
package qwak.demo;
option java_multiple_files = true;
option java_package = "com.qwak.ai.demo";
message ModelOutput {
float prediction = 1;
}
We have to generate the protobuf classes for both the client application and the ML code. Of course, the ML code uses Python implementation. We will store the Python files in the input_pb
and the output_pb
files in the qwak_proto_demo
directory.
In the model class, we will have to import the protobuf class and the input adapter:
from qwak.model.adapters import ProtoInputAdapter, ProtoOutputAdapter
from .qwak_proto_demo.input_pb import ModelInput
from .qwak_proto_demo.output_pb import ModelOutput
In the next step, we configure the input and output adapter as a decorator of the predict
function:
@qwak.api(
analytics=False,
input_adapter=ProtoInputAdapter(ModelInput),
output_adapter=ProtoOutputAdapter(),
)
def predict(self, input) -> ModelOutput:
...
return ModelOutput(prediction=prediction_from_the_model)
In our implementation, we use the ParseFromString
function to read a protobuf message, so remember to serialize your classes using the SerializeToString
function!
message = ModelInput(f1=1, f2=2).SerializeToString()
TF Tensor
If we have all of the preprocessing code running as a separate service, we can pass a Tensorflow tensor directly to the model using a TfTensorInputAdapter
.
In this case, we import the adapter and configure the predict
function's decorator:
from qwak.model.adapters import TfTensorInputAdapter
@qwak.api(analytics=False, input_adapter=TfTensorInputAdapter())
def predict(self, tensor) -> pd.DataFrame:
To pass a tensor to a deployed model, we must send a JSON representation of the tensor. For example, if we used curl, the request would look like this:
curl -i –header "Content-Type: application/json" –request POST –data '{"instances": [1]}' qwak_rest_url
Multi Input
The MultiInputAdapter
supports Automatic input format detection.
Sometimes we want to deploy a single model with multiple different input adapters. We could create a copy of the model, change the input adapter, and deploy multiple copies. However, we can also use a MultiInputAdapter
to handle various input formats with a single model.
from qwak.model.adapters import DefaultOutputAdapter, DataframeInputAdapter, ImageInputAdapter, MultiInputAdapter
@qwak.api(
analytics=False,
input_adapter=MultiInputAdapter([ImageInputAdapter, DataframeInputAdapter]),
output_adapter=DefaultOutputAdapter(),
)
To use the MultiInputAdapter
adapter, we must pass a list of adapters to its constructor. The MultiInputAdapter
parses the data using the first compatible parser!
In our example, if a given input can be parsed as an Image, we will get an image in the predict function. If not, we will get a Pandas dataframe. If all parsers fail, the model returns an error.
Be careful with the following adapter configuration:
input_adapter=MultiInputAdapter([JsonInputAdapter, DataframeInputAdapter]),
The JsonInputAdapter
will successfully parse a JSON representation of a DataFrame!
Numpy
A NumpyInputAdapter
can automatically parse a JSON array as a Numpy array and reshape it to the desired structure. When we configure the NumpyInputAdapter
, we have to specify the content type and its shape:
from qwak.model.adapters import NumpyInputAdapter, NumpyOutputAdapter
@qwak.api(
analytics=False,
input_adapter=NumpyInputAdapter(
shape=(2, 2), enforce_shape=False, dtype="int32"
),
output_adapter=NumpyOutputAdapter(),
)
def predict(self, input):
If we configure the input adapter as in the example above, and send the following value to the model: [[5,4,3,2]]
, we will get a result equivalent to running np.array([[5, 4, 3, 2]], dtype=np.int32).reshape(2, 2)
.
The NumpyOutputAdapter
converts the returned output array directly to JSON without changing its structure. For example, if the model returns this Numpy array: np.array([[5, 4, 3, 2]], dtype=np.int32).reshape(2, 2)
, it will get converted to: [[5, 4], [3, 2]]
.
Starting from Sdk version 0.9.87 it will return numpy binary format .
Default Output
With DefaultOutputAdapter
we can return multiple result formats from a single model. The adapter will automatically detect the type of the returned value.
from qwak.model.adapters import DefaultOutputAdapter, ImageInputAdapter
@qwak.api(
analytics=False,
input_adapter=ImageInputAdapter(),
output_adapter=DefaultOutputAdapter(),
)
def predict(self, input):
Note that the DefaultOutputAdapter doesn't work with Protobuf objects! To automatically detect the output type when your code returns DataFrames, JSONs, and Protobuf objects, you need to use the AutodetectOutputAdapter
.
Json Output
With JsonOutputAdapter
we can return Dict
results, but the result has to be iterable
from qwak.model.adapters import ProtoInputAdapter, AutodetectOutputAdapter
@qwak.api(
analytics=False,
input_adapter=ProtoInputAdapter(ModelInput),
output_adapter=AutodetectOutputAdapter(),
)
def predict(self, df) -> JsonOutputAdapter:
...
return [{"result": ...}]
Auto Detect Output
Automatic output format detection with Protobuf support
This adapter works like the DefaultOutputAdapter
, but it can also handle Protobuf classes:
from qwak.model.adapters import ProtoInputAdapter, AutodetectOutputAdapter
@qwak.api(
analytics=False,
input_adapter=ProtoInputAdapter(ModelInput),
output_adapter=AutodetectOutputAdapter(),
)
def predict(self, df) -> ModelOutput:
...
return [ModelOutput(prediction=result)]
Data Frame Adapter
Pandas Dataframe is supported with requested orient. For Example:
from qwak.model.adapters import ProtoInputAdapter, AutodetectOutputAdapter
@qwak.api(
analytics=False,
input_adapter=DataframeInputAdapter(input_orient="split"),
output_adapter=DataFrameOutputAdapter(output_orient="records"),
)
def predict(self, df) -> ModelOutput:
...
return [ModelOutput(prediction=result)]
Note that the you can just choose the Data frame adapters with default value like below:
from qwak.model.adapters import ProtoInputAdapter, AutodetectOutputAdapter
@qwak.api(
analytics=False,
input_adapter=DataframeInputAdapter(),
output_adapter=DataFrameOutputAdapter(),
)
def predict(self, df) -> ModelOutput:
...
return [ModelOutput(prediction=result)]
In this case for the DataframeInputAdapter
will try to automatically recognize the type of the input. For DataFrameOutputAdapter
the output will be orient by records
Updated 11 months ago