Difference between revisions of "MongoDB QueryData"
Line 112: | Line 112: | ||
A number of operations exist for the aggregation pipeline, details of which can be found in the MongoDB manual: | A number of operations exist for the aggregation pipeline, details of which can be found in the MongoDB manual: | ||
− | https://docs.mongodb.com/manual/reference/operator/aggregation/ | + | https://docs.mongodb.com/manual/reference/operator/aggregation/ |
+ | |||
+ | array has various operators and the one we are interested in is '''$filter''' | ||
+ | |||
+ | This returns a subset of the array with only the elements that match the filter condition | ||
+ | |||
+ | <pre style:"color:blue"> | ||
+ | $filter has the following syntax: | ||
+ | { $filter: { | ||
+ | input: <array>, /* expression for the array */ | ||
+ | as: <string>, /* variable name for the element */ | ||
+ | cond: <expression> /* filter condition */ | ||
+ | } } | ||
+ | </pre> | ||
== Next Step == | == Next Step == | ||
[[MongoDB_Update|Updating]] the collection | [[MongoDB_Update|Updating]] the collection |
Revision as of 16:47, 9 November 2016
Main Page >> MongoDB >>MongoDB Workbook >> Querying Collections
Querying a collection
The find() function can be used to query the documents.
The format is:
db.collectionName.find(optional_query_criteria)
Where the query_criteria follows a pattern:
db.collectionName.find({fieldName: "value"})
Note:
- the criteria is enclosed in curly brackets: {}
- the value needs quotes if it is a string or date value
- quotes are optional for the fieldName, so long as they do not contain spaces
- if the fieldName refers to a nested document, the name must be in matching single or double quotes
Find all documents
For example, show all the data so far in the deptCollection:
db.deptCollection.find()
The data comes back messy. The pretty() function can be used to improve the layout::
db.deptCollection.find().pretty()
Find One document
To find just one document - department 10:
db.deptCollection.find({deptno:10}).pretty()
Finding an employee means using the array name too:
db.deptCollection.find({"employees.empno":7902}).pretty()
However, this does mean you get back all the employees in the department they were found in!
Since Version 2.2 MongoDB's new $elemMatch can be used with arrays to return only the first element matching the $elemMatch condition:
db.deptCollection.find({deptno:20}, { _id: 0, employees: {$elemMatch: {empno: 7902}}}).pretty()
$elemMatch limits the contents of the employees array to contain only the first element matching the $elemMatch condition.
This is akin to a SQL query:
SELECT * FROM Emp WHERE deptno=20 AND empno = 7902
_id is a unique value automatically generated by MongoDB (like a Primary Key, except it is unique for the whole database).
Using _id:0 suppresses the value, however to see it:
db.deptCollection.find({deptno:20}, { employees: {$elemMatch: {empno: 7902}}}).pretty()
More about _ids in the next section.
Find with Query Criteria
The query criteria can be as complex as that found in SQL.
To find all employees earning more than 2000 in department 10:
db.deptCollection.find({deptno:10}, { employees: {$elemMatch: {sal: { $gt: 2000}}}}).pretty()
Same again for department 20 and the managers:
db.deptCollection.find({deptno:20}, { employees: {$elemMatch: {sal: { $gt: 2000}, job: "MANAGER"}}}).pretty()
employees is an array, so $elemMatch only returns the first matching value. What if we try this instead:
db.deptCollection.find({ "employees.sal" : { $gt: 2000}}).pretty()
Things to note:
- This time employees.sal must be enclosed in matching single or double quotes.
- Comment on what the above query returns.
- If you examine the data carefully, if an element of an array is found to be true, then all the elements are returned, or one only. Is this good practice?
Find departments with no managers:
db.deptCollection.find({ "employees.job" : { $ne: "MANAGER"}}).pretty()
Aggregation Pipeline
So far find() either returns all the elements of an array, if one element matches the search criteria, or $elematch returns the first one found only. The latter is fine if there is only one to be found, but not so good if several items in the array should match the search criteria.
The aggregation pipeline is a framework for data aggregation modelled on the concept of data processing pipelines. What this means, is documents enter a multi-stage pipeline that transforms the documents into aggregated results.
This is similar to using GROUP BY in SQL, where you might aggregate the average grades of all students taking a module.
The MongoDB aggregation pipeline consists of stages and each stage transforms the documents as they pass through the pipeline.
We can use this to gather elements of our employees array to get the employees matching the query criteria only, rather than one, or everyone.
$filter
A number of operations exist for the aggregation pipeline, details of which can be found in the MongoDB manual:
https://docs.mongodb.com/manual/reference/operator/aggregation/
array has various operators and the one we are interested in is $filter
This returns a subset of the array with only the elements that match the filter condition
$filter has the following syntax: { $filter: { input: <array>, /* expression for the array */ as: <string>, /* variable name for the element */ cond: <expression> /* filter condition */ } }
Next Step
Updating the collection