Computer Vision Cloud API with over 1.5 billion requests processed

Bring Sighthound’s computer vision capabilities into your project with our detection, recognition, and demographics APIs for cloud-connected applications.

Detection API & Recognition API

Sighthound Cloud offers a Detection API for person, face, gender, age, and facial landmark detections; and a Recognition API that developers can use for face and vehicle recognition applications. Try out the following demo to see the Detection API and Recognition API in action.


Sighthound Cloud API Service

Sighthound Cloud is an API service that allows developers to add computer vision to their applications. Simple REST APIs return the location of any people and faces found in an image. Faces can be analyzed for gender, age, and facial landmark detections. Our landmark detector can find the various facial features in the detected faces, including eyes, nose and mouth, by fitting 68 landmark points to those features.

The detectors and recognizers are all based on state-of-the-art research from Sighthound’s Computer Vision Research Lab, and have higher rates of precision, recall and mean accuracy than other publicly available software.

Sighthound Cloud is free for developers and available for commercial purposes to those with a paid account. To get started, see below.

Quickstart Guides

Recognition API

To get started with the Recognition API refer to the API documentation.

Detection API

View the Detection API code examples below or refer to the API documentation for full details.

 Code Examples

var image = {image: ""};
var xmlhttp = new XMLHttpRequest();
var result;

xmlhttp.onreadystatechange = function () {
if (xmlhttp.readyState === 4 && xmlhttp.status === 200) {
result = xmlhttp.responseText;
}"POST", ",person&faceOption=landmark,gender");
xmlhttp.setRequestHeader("Content-type", "application/json");
xmlhttp.setRequestHeader("X-Access-Token", "YOUR-CLOUD-TOKEN");

 URL and Headers

POST type=face,person&faceOption=landmark,gender
Content-Type: application/json
X-Access-Token: Your-API-Key

Optional URL Parameters


A comma-separated list of object categories to detect. Valid options are ‘all’, ‘face’, ‘person’. Default is ‘all’.


For type 'face', additional detections can be performed by passing comma-separated list of values. Valid options are ‘gender’ and ‘landmark’. Default is face bounding box only.

Body Parameters


The image to analyze. This can be a URL to an image (authentication data in URL is accepted) or inline as base64 encoded data.


The result is a JSON array of all detected objects and information about the processed image.

{ "image": {
"width": 1280, "height": 960, "orientation": 1},
"objects": [
{"type": "person",
"boundingBox": { "x": 363, "y": 182, "height": 778, "width": 723} },
{"type": "face",
"boundingBox": {"x": 508, "y": 305, "height": 406, "width": 406,},
"attributes": {
"gender": "male", "genderConfidence": 0.9883, "frontal": true},
"landmarks": {
"faceContour": [[515,447],[517,491]...[872,436]],
"noseBridge": [[710,419],[711,441]...[712,487]],
"noseBall": [[680,519],[696,522]...[742,518]],
"eyebrowRight": [[736,387],[768,376]...854,394]],
"eyebrowLeft": [[555,413],[578,391]...679,391]],
"eyeRight": [[753,428],[774,414]...[777,432]],
"eyeRightCenter": [[786,423]],
"eyeLeft": [[597,435],[617,423]...[619,442]],
"eyeLeftCenter": [[630,432]],
"mouthOuter": [[650,590],[674,572]...[675,600]],
"mouthInner": [[661,587],[697,580]...[697,584]]}


An array of all detected objects. Each includes the type of detection results returned, face or person, and a boundingBox of the object's location in the image. The xywidth, and height values are defined in a coordinate space with (0,0) as the top left corner of the image.


The widthheight, and orientation of the processed image. Orientation defaults to 1, else the value found in the image's Exif data and indicates that bounding boxes have been translated to match that coordinate space.

Have questions? Need help?

Get Support

Want to talk about a project? Contact us!