构建跟踪接收器
如果你正在阅读本教程,你可能已经了解了分布式跟踪背后的 OpenTelemetry 概念,但如 果你还没有,你可以快速阅读它在这里.
以下是 OpenTelemetry 对这些概念的定义:
跟踪跟踪单个请求的进程,称为跟踪,因为它是由组成应用程序的服务处理的。请求可以 由用户或应用程序发起。分布式跟踪是一种跨越进程、网络和安全边界的跟踪形式。
尽管这个定义看起来非常以应用程序为中心,但是您可以利用 OpenTelemetry 跟踪模型来 表示请求,并快速了解请求的持续时间和完成请求所涉及的每个步骤的详细信息。
假设您已经有了一个生成某种跟踪遥测的系统,那么OpenTelemetry Collector就是帮助您将其用于 OTel 世界的门户。
在 Collector 中,跟踪接收器的作用是接收您的遥测请求,并将其从原始格式转换为 OTel 跟踪模型,以便通过 Collector 的管道正确处理信息。
为了实现跟踪接收器,您将需要以下内容:
- 一个
Config
实现,使跟踪接收器能够在收集器的config.yaml
文件中收集和验证其配 置。 - 一个
receiver.Factory
实现,以便收集器可以正确地实例化跟踪接收器组件。 - 一个
TracesReceiver
实现,负责收集遥测信息,将其转换为内部跟踪表示,并将信息传 递给管道中的下一个消费者。
在本教程中,我们将创建一个名为tailtracer
的示例跟踪接收器,它模拟拉操作并生成跟
踪作为该操作的结果。接下来的部分将指导您完成实现上述步骤的过程,以便创建接收器,
所以让我们开始吧。
设置接收器开发和测试环境¶
First use the Building a Custom Collector
tutorial to create a Collector instance named otelcol-dev
; all you need is to
copy the builder-config.yaml
described on Step 2 and make the following
changes:
As an outcome you should now have a otelcol-dev
folder with your Collector's
development instance ready to go.
In order to properly test your trace receiver, you will need a distributed
tracing backend so the Collector can send the telemetry to it. We will be using
Jaeger, if you
don't have a Jaeger
instance running, you can easily start one using Docker
with the following command:
Now, create a config.yaml
file so you can set up your Collector's components.
For now, you just need a basic traces pipeline with the otlp
receiver, the
jaeger
and logging
exporters, here is what your config.yaml
file should
look like:
config.yaml
Notice that I am only using the insecure
flag in my jaeger
receiver config
to make my local development setup easier; you should not use this flag when
running your collector in production.
In order to verify that your initial pipeline is properly set up, you should
have the following output after running your otelcol-dev
command:
Make sure you see the last line, that will confirm that the Jaeger exporter has successfully established a connection to your local Jaeger instance. Now that we have our environment ready, let's start writing your receiver's code.
Now, create another folder called tailtracer
so we can have a place to host
all of our receiver code.
Every Collector's component should be created as a Go module, so you will need
to properly initialize the tailtracer
module. In my case here is what the
command looked like:
读取和验证您的接收器设置¶
In order to be instantiated and participate in pipelines the Collector needs to identify your receiver and properly load its settings from within its configuration file.
The tailtracer
receiver will have the following settings:
interval
: a string representing the time interval (in minutes) between telemetry pull operationsnumber_of_traces
: the number of mock traces generated for each interval
Here is what the tailtracer
receiver settings will look like:
Under the tailtracer
folder, create a file named config.go
where you will
write all the code to support your receiver settings.
To implement the configuration aspects of a receiver you need create a Config
struct. Add the following code to your config.go
file:
In order to be able to give your receiver access to its settings the Config
struct must have a field for each of the receiver's settings.
Here is what your config.go
file should look like after you implemented the
requirements above.
config.go
检查你的工作
- I added the
Interval
and theNumberOfTraces
fields so I can properly have access to their values from the config.yaml.
Now that you have access to the settings, you can provide any kind of validation
needed for those values by implementing the Validate
method according to the
optional
ConfigValidator interface.
In this case, the interval
value will be optional (we will look at generating
default values later) but when defined should be at least 1 minute (1m) and the
number_of_traces
will be a required value. Here is what the config.go looks
like after implementing the Validate
method.
config.go
检查你的工作
- I imported the
fmt
package, so I can properly format print my error messages.- I added the
Validate
method to my Config struct where I am checking if theinterval
setting value is at least 1 minute (1m) and if thenumber_of_traces
setting value is greater or equal to 1. If that is not true the Collector will generate an error during its startup process and display the message accordingly.
If you want to take a closer look at the structs and interfaces involved in the configuration aspects of a component, take a look at the component/config.go file inside the Collector's GitHub project.
使收集器实例化您的接收器¶
At the beginning of this tutorial, you created your otelcol-dev
instance,
which is bootstrapped with the following components:
- Receivers: OTLP Receiver
- Processors: Batch Processor
- Exporters: Logging and Jaeger Exporters
Go ahead and open the components.go
file under the otelcol-dev
folder, and
let's take a look at the components()
function.
As you can see, the components()
function is responsible to provide the
Collector the factories for all its components which is represented by a
variable called factories
of type otelcol.Factories
(here is the declaration
of the
otelcol.Factories struct), which will then be
used to instantiate the components that are configured and consumed by the
Collector's pipelines.
Notice that factories.Receivers
is the field holding a map to all the receiver
factories (instances of receiver.Factory
), and it currently has the
otlpreceiver
factory only which is instantiated through the
otlpreceiver.NewFactory()
function call.
The tailtracer
receiver has to provide a receiver.Factory
implementation,
and although you will find a receiver.Factory
interface (you can find its
definition in the
receiver/receiver.go file within the Collector's
project ), the right way to provide the implementation is by using the functions
available within the go.opentelemetry.io/collector/receiver
package.
实现你的 receiver.Factory¶
Start by creating a file named factory.go within the tailtracer
folder.
Now let's follow the convention and add a function named NewFactory()
that
will be responsible to instantiate the tailtracer
factory. Go ahead the add
the following code to your factory.go
file:
In order to instantiate your tailtracer
receiver factory, you will use the
following function from the receiver
package:
The receiver.NewFactory()
instantiates and returns a receiver.Factory
and it
requires the following parameters:
-
component.Type
: a unique string identifier for your receiver across all Collector's components. -
component.CreateDefaultConfigFunc
: a reference to a function that returns the component.Config instance for your receiver. -
...FactoryOption
: the slice ofreceiver.FactoryOption
s that will determine what type of signal your receiver is capable of processing.
Let's now implement the code to support all the parameters required by
receiver.NewFactory()
.
识别并提供接收方的默认设置¶
Previously, we said that the interval
setting for our tailtracer
receiver
would be optional, in that case you will need to provide a default value for it
so it can be used as part of the default settings.
Go ahead and add the following code to your factory.go
file:
As for default settings, you just need to add a function that returns a
component.Config
holding the default configurations for the tailtracer
receiver.
To accomplish that, go ahead and add the following code to your factory.go
file:
After these two changes you will notice a few imports are missing, so here is
what your factory.go
file should look like with the proper imports:
factory.go
检查你的工作
- Importing the
time
package in order to support the time.Duration type for the defaultInterval- Importing the
go.opentelemetry.io/collector/component
package, which is wherecomponent.Config
is declared.- Importing the
go.opentelemetry.io/collector/receiver
package, which is wherereceiver.Factory
is declared.- Added a
time.Duration
constant calleddefaultInterval
to represent the default value for our receiver'sInterval
setting. We will be setting the default value for 1 minute hence the assignment of1 * time.Minute
as its value.- Added a function called
createDefaultConfig
which is responsible to return a component.Config implementation, which in this case is going to be an instance of ourtailtracer.Config
struct.- The
tailtracer.Config.Interval
field was initialized with thedefaultInterval
constant.
使工厂能够将接收器描述为能够处理跟踪¶
The same receiver component can process traces, metrics, and logs. The receiver's factory is responsible for describing those capabilities.
Given that traces are the subject of the tutorial, that's the only signal we
will enable the tailtracer
receiver to work with. The receiver
package
provides the following function and type to help the factory describe the trace
processing capabilities:
The receiver.WithTraces()
instantiates and returns a receiver.FactoryOption
and it requires the following parameters:
createTracesReceiver
: A reference to a function that matches thereceiver.CreateTracesFunc
type
The receiver.CreateTracesFunc
type is a pointer to a function that is
responsible to instantiate and return a receiver.Traces
instance and it
requires the following parameters:
context.Context
: the reference to the Collector'scontext.Context
so your trace receiver can properly manage its execution context.receiver.CreateSettings
: the reference to some of the Collector's settings under which your receiver is created.component.Config
: the reference for the receiver config settings passed by the Collector to the factory so it can properly read its settings from the Collector config.consumer.Traces
: the reference to the nextconsumer.Traces
in the pipeline, which is where received traces will go. This is either a processor or an exporter.
Start by adding the bootstrap code to properly implement the
receiver.CreateTracesFunc
function pointer. Go ahead and add the following
code to your factory.go
file:
You now have all the necessary components to successfully instantiate your
receiver factory using the receiver.NewFactory
function. Go ahead and and
update your NewFactory()
function in your factory.go
file as follow:
After these two changes you will notice a few imports are missing, so here is
what your factory.go
file should look like with the proper imports:
factory.go
检查你的工作
- Importing the
context
package in order to support thecontext.Context
type referenced in thecreateTracesReceiver
function- Importing the
go.opentelemetry.io/collector/consumer
package in order to support theconsumer.Traces
type referenced in thecreateTracesReceiver
function- Updated the
NewFactory()
function so it returns thereceiver.Factory
generated by thereceiver.NewFactory()
call with the required parameters. The generated receiver factory will be capable of processing traces through the call toreceiver.WithTraces(createTracesReceiver, component.StabilityLevelAlpha)
At this point, you have the tailtracer
factory and config code needed for the
Collector to validate the tailtracer
receiver settings if they are defined
within the config.yaml
. You just need to add it to the Collector's
initialization process.
将接收器工厂添加到收集器的初始化中¶
As explained before, all the Collector components are instantiated by the
components()
function within the components.go
file.
The tailtracer
receiver factory instance has to be added to the factories
map so the Collector can load it properly as part of its initialization process.
Here is what the components.go
file looks like after making the changes to
support that:
components.go
检查你的工作
- Importing the
github.com/rquedas/otel4devs/collector/receiver/trace-receiver/tailtracer
module which is where the receiver types and function are.- Added a call to
tailtracer.NewFactory()
as a parameter of thereceiver.MakeFactoryMap()
call so yourtailtracer
receiver factory is properly added to thefactories
map.
We added the tailtracer
receiver settings to the config.yaml
previously, so
here is what the beginning of the output for running your Collector with
otelcol-dev
command should look like after building it with the current
codebase:
Look for the log line for "builder/receivers_builder.go:111" (it's the 4th line
from the bottom at the snippet showed here), you can see that the Collector
found the settings for the tailtracer
receiver, validated them (the current
settings are all correct), but ignores the receiver given that it's not used in
any pipeline.
Let's check if the tailtracer
factory is validating the receiver settings
correctly, the interval
setting isn't required, so if you remove it from the
config.yaml
and run the command again you should get the same output.
Now, let's test one of the tailtracer
settings validation rules. Remove the
number_of_traces
setting from the config.yaml
, and here is what the output
for running the Collector will look like:
The tailtracer
receiver factory and config requirements are done and the
Collector is properly loading your component. You can now move to the core of
your receiver, the implementation of the component itself.
实现跟踪接收器组件¶
In the previous section, I mentioned the fact that a receiver can process any of the OpenTelemetry signals, and the Collector's API is designed to help you accomplish that.
All the receiver APIs responsible to enable the signals are currently declared in the receiver/receiver.go file within the OTel Collector's project in GitHub, open the file and take a minute to browse through all the interfaces declared in it.
Notice that receiver.Traces
(and its siblings receiver.Metrics
and
receiver.Logs
) at this point in time, doesn't describe any specific methods
other than the ones it "inherits" from component.Component
.
It might feel weird, but remember, the Collector's API was meant to be extensible, and the components and their signals might evolve in different ways, so the role of those interfaces exist to help support that.
So, to create a receiver.Traces
, you just need to implement the following
methods described by component.Component
interface:
Both methods actually act as event handlers used by the Collector to communicate with its components as part of their lifecycle.
The Start()
represents a signal of the Collector telling the component to
start its processing. As part of the event, the Collector will pass the
following information:
context.Context
: Most of the time, a receiver will be processing a long-running operation, so the recommendation is to ignore this context and actually create a new one from context.Background().Host
: The host is meant to enable the receiver to communicate with the Collector's host once it's up and running.
The Shutdown()
represents a signal of the Collector telling the component that
the service is getting shutdown and as such the component should stop its
processing and make all the necessary cleanup work required:
context.Context
: the context passed by the Collector as part of the shutdown operation.
You will start the implementation by creating a new file called
trace-receiver.go
within your project's tailtracer
folder and add the
declaration to a type type called tailtracerReceiver
as follow:
Now that you have the tailtracerReceiver
type you can implement the Start()
and Shutdown() methods so the receiver type can be compliant with the
receiver.Traces
interface.
Here is what the tailtracer/trace-receiver.go
file should look like with the
methods implementation:
trace-receiver.go
检查你的工作
- Importing the
context
package which is where theContext
type and functions are declared- Importing the
go.opentelemetry.io/collector/component
package which is where theHost
type is declared- Added a bootstrap implementation of the
Start(ctx context.Context, host component.Host)
method to comply with thereceiver.Traces
interface.- Added a bootstrap implementation of the
Shutdown(ctx context.Context)
method to comply with thereceiver.Traces
interface.
The Start()
method is passing 2 references (context.Context
and
component.Host
) that your receiver might need to keep so they can be used as
part of its processing operations.
The context.Context
reference should be used for creating a new context to
support you receiver processing operations, and in that case you will need to
decide the best way to handle context cancellation so you can finalize it
properly as part of the component's shutdown within the Shutdown()
method.
The component.Host
can be useful during the whole lifecycle of the receiver so
you should keep that reference within your tailtracerReceiver
type.
Here is what the tailtracerReceiver
type declaration will look like after you
include the fields for keeping the references suggested above:
Now you need to update the Start()
methods so the receiver can properly
initialize its own processing context and have the cancellation function kept in
the cancel
field and also initialize its host
field value. You will also
update the Stop()
method in order to finalize the context by calling the
cancel
function.
Here is what the trace-receiver.go
file look like after making the changes
above:
trace-receiver.go
检查你的工作
- Updated the
Start()
method by adding the initialization to thehost
field with thecomponent.Host
reference passed by the Collector and thecancel
function field with the cancellation based on a new context created withcontext.Background()
(according the Collector's API documentation suggestions).- Updated the
Stop()
method by adding a call to thecancel()
context cancellation function.
保存由接收方工厂传递的信息¶
Now that you have implemented the receiver.Traces
interface methods, your
tailtracer
receiver component is ready to be instantiated and returned by its
factory.
Open the tailtracer/factory.go
file and navigate to the
createTracesReceiver()
function. Notice that the factory will pass references
as part of the createTracesReceiver()
function parameters that your receiver
actually requires to work properly like its configuration settings
(component.Config
), the next Consumer
in the pipeline that will consume the
generated traces (consumer.Traces
) and the Collector's logger so the
tailtracer
receiver can add meaningful events to it
(receiver.CreateSettings
).
Given that all this information will be only be made available to the receiver
at the moment its instantiated by the factory, The tailtracerReceiver
type
will need fields to keep that information and use it within other stages of its
lifecycle.
Here is what the trace-receiver.go
file looks like with the updated
tailtracerReceiver
type declaration:
trace-receiver.go
检查你的工作
- Importing the
go.opentelemetry.io/collector/consumer
which is where the pipeline's consumer types and interfaces are declared.- Importing the
go.uber.org/zap
package, which is what the Collector uses for its logging capabilities.- Added a
zap.Logger
field namedlogger
so we can have access to the Collector's logger reference from within the receiver.- Added a
consumer.Traces
field namednextConsumer
so we can push the traces generated by thetailtracer
receiver to the next consumer declared in the Collector's pipeline.- Added a
Config
field namedconfig
so we can have access to receiver's configuration settings defined within the Collector's config.- Added a variable named
interval
that will be initialized as atime.Duration
based on the value of theinterval
settings of thetailtracer
receiver defined within the Collector's config.- Added a
go func()
to implement theticker
mechanism so our receiver can generate traces every time theticker
reaches the amount of time specified by theinterval
variable and used thetailtracerRcvr.logger
field to generate a info message every time the receiver supposed to be generating traces.
The tailtracerReceiver
type is now ready to be instantiated and keep all
meaningful information passed by its factory.
Open the tailtracer/factory.go
file and navigate to the
createTracesReceiver()
function.
The receiver is only instantiated if it's declared as a component within a pipeline and the factory is responsible to make sure the next consumer (either a processor or exporter) in the pipeline is valid otherwise it should generate an error.
The Collector's API provides some standard error types to help the factory
handle pipeline configurations. Your receiver factory should throw a
component.ErrNilNextConsumer
in case the next consumer has an issue and is
passed as nil.
The createTracesReceiver()
function will need a guard clause to make that
validation.
You will also need variables to properly initialize the config
and the
logger
fields of the tailtracerReceiver
instance.
Here is what the factory.go
file looks like with the updated
createTracesReceiver()
function:
factory.go
检查你的工作
- Added a guard clause that verifies if the consumer is properly instantiated and if not returns the
component.ErrNilNextConsumer
error.- Added a variable called
logger
and initialized it with the Collector's logger that is available as a field namedLogger
within thereceiver.CreateSettings
reference.- Added a variable called
tailtracerCfg
and initialized it by casting thecomponent.Config
reference to thetailtracer
receiverConfig
.- Added a variable called
traceRcvr
and initialized it with thetailtracerReceiver
instance using the factory information stored within the variables.- Updated the return statement to now include the
traceRcvr
instance.
With the factory fully implemented and instantiating the trace receiver
component you are ready to test the receiver as part of a pipeline. Go ahead and
add the tailtracer
receiver to your traces
pipeline in the config.yaml
as
follow:
Here is what the output for running your Collector with otelcol-dev
command
should look like after you updated the traces
pipeline:
Look for the log line for "builder/receivers_builder.go:68 Receiver is
starting... {"kind": "receiver", "name": "tailtracer"}", you can see that the
Collector found the settings for the tailtracer
receiver within the traces
pipeline and is now instantiating it and starting it given that 1 minute after
the Collector has started, you can see the info line we added to the ticker
function within the Start()
method.
Now, go ahead and press Ctrl + C in your Collector's terminal so you want watch the shutdown process happening. Here is what the output should look like:
As you can see there is an info log line for the tailtracer
receiver which
means the component is responding correctly to the Shutdown()
event. In the
next section you will learn more about the OpenTelemetry Trace data model so the
tailtracer
receiver can finally generate traces!
收集器的跟踪数据模型¶
You might be familiar with OpenTelemetry traces by using the SDKs and instrumenting an application so you can see and evaluate your traces within a distributed tracing backend like Jaeger.
Here is what a trace looks like in Jaeger:
Granted, this is a Jaeger trace, but it was generated by a trace pipeline within the Collector, therefore you can use it to learn a few things about the OTel trace data model :
- A trace is made of one or multiple spans structured within a hierarchy to represent dependencies.
- The spans can represent operations within a service and/or across services.
Creating a trace within the trace receiver will be slightly different than the way you would do it with the SDKs, so let's start reviewing the high level concepts.
使用资源¶
In the OTel world, all telemetry is generated by a Resource
, here is the
definition according to the OTel spec:
A
Resource
is an immutable representation of the entity producing telemetry as Attributes. For example, a process producing telemetry that is running in a container on Kubernetes has a Pod name, it is in a namespace and possibly is part of a Deployment which also has a name. All three of these attributes can be included in theResource
.
Traces are most commonly used to represent a service request (the Services
entity described by Jaeger's model), which are normally implemented as processes
running in a compute unit, but OTel's API approach to describe a Resource
through attributes is flexible enough to represent any entity that you may
require like ATMs, IoT sensors, the sky is the limit.
So it's safe to say that for a trace to exist, a Resource
will have to start
it.
In this tutorial we will simulate a system that has telemetry that demonstrate
ATMs located in 2 different states (eg: Illinois and California) accessing the
Account's backend system to execute balance, deposit and withdraw operations,
therefore we will have to implement code to create the Resource
types
representing the ATM and the backend system.
Go ahead and create a file named model.go
inside the tailtracer
folder
Now, within the model.go
file, add the definition for the Atm
and the
BackendSystem
types as follow:
model.go
These types are meant to represent the entities as they are within the system
being observed and they contain information that would be quite meaningful to be
added to the traces as part of the Resource
definition. You will add some
helper functions to generate the instances of those types.
Here is what the model.go
file will look with the helper functions:
model.go
检查你的工作
- Imported the
math/rand
andtime
packages to support the implementation of thegenerateRandomNumber
function- Added the
generateAtm
function that instantiates anAtm
type and randomly assign either Illinois or California as values forStateID
and the equivalent value forISPNetwork
- Added the
generateBackendSystem
function that instantiates aBackendSystem
type and randomly assign service endpoint values for theEndpoint
field- Added the
generateRandomNumber
function to help generating random numbers between a desired range.
Now that you have the functions to generate object instances representing the entities generating telemetry, you are ready to represent those entities in the OTel Collector world.
The Collector's API provides a package named ptrace
(nested under the pdata
package) with all the types, interfaces and helper functions required to work
with traces within the Collector's pipeline components.
Open the tailtracer/model.go
file and add
go.opentelemetry.io/collector/pdata/ptrace
to the import
clause so you can
have access to the ptrace
package capabilities.
Before you can define a Resource
, you need to create a ptrace.Traces
that
will be responsible to propagate the traces through the Collector's pipeline and
you can use the helper function ptrace.NewTraces()
to instantiate it. You will
also need to create instances of the Atm
and BackendSystem
types so you can
have data to represent the telemetry sources involved in your trace.
Open the tailtracer/model.go
file and add the following function to it:
By now you have heard and read enough about how traces are made up of Spans. You have probably also written some instrumentation code using the SDK's functions and types available to create them, but what you probably didn't know, is that within the Collector's API, that there are other types of "spans" involved in creating a trace.
You will start with a type called ptrace.ResourceSpans
which represents the
resource and all the operations that it either originated or received while
participating in a trace. You can find its definition within the
/pdata/internal/data/protogen/trace/v1/trace.pb.go.
ptrace.Traces
has a method named ResourceSpans()
which returns an instance
of a helper type called ptrace.ResourceSpansSlice
. The
ptrace.ResourceSpansSlice
type has methods to help you handle the array of
ptrace.ResourceSpans
that will contain as many items as the number of
Resource
entities participating in the request represented by the trace.
ptrace.ResourceSpansSlice
has a method named AppendEmpty()
that adds a new
ptrace.ResourceSpan
to the array and return its reference.
Once you have an instance of a ptrace.ResourceSpan
you will use a method named
Resource()
which will return the instance of the pcommon.Resource
associated
with the ResourceSpan
.
Update the generateTrace()
function with the following changes:
- add a variable named
resourceSpan
to represent theResourceSpan
- add a variable named
atmResource
to represent thepcommon.Resource
associated with theResourceSpan
. - Use the methods mentioned above to initialize both variables respectively.
Here is what the function should look like after you implemented these changes:
检查你的工作
- Added the
resourceSpan
variable and initialized it with theResourceSpan
reference returned by thetraces.ResourceSpans().AppendEmpty()
call- Added the
atmResource
variable and initialized it with thepcommon.Resource
reference returned by theresourceSpan.Resource()
call
通过属性描述资源¶
The Collector's API provides a package named pcommon
(nested under the pdata
package) with all the types and helper functions required to describe a
Resource
.
In the Collector's world, a Resource
is described by attributes in a key/value
pair format represented by the pcommon.Map
type.
You can check the definition of the pcommon.Map
type and the related helper
functions to create attribute values using the supported formats in the
/pdata/pcommon/common.go file within the Otel
Collector's GitHub project.
Key/value pairs provide a lot of flexibility to help model your Resource
data,
so the OTel specification has some guidelines in place to help organize and
minimize the conflicts across all the different types of telemetry generation
entities that it may need to represent.
These guidelines are known as Resource Semantic Conventions and are documented in the OTel specification.
When creating your own attributes to represent your own telemetry generation entities, you should follow the guideline provided by the specification:
Attributes are grouped logically by the type of the concept that they described. Attributes in the same group have a common prefix that ends with a dot. For example all attributes that describe Kubernetes properties start with
k8s.
Let's start by opening the tailtracer/model.go
and adding
go.opentelemetry.io/collector/pdata/pcommon
to the import
clause so you can
have access to the pcommon
package capabilities.
Now go ahead and add a function to read the field values from an Atm
instance
and write them as attributes (grouped by the prefix "atm.") into a
pcommon.Resource
instance. Here is what the function looks like:
检查你的工作
- Declared a variable called
atmAttrs
and initialized it with thepcommon.Map
reference returned by theresource.Attributes()
call- Used the
PutInt()
andPutStr()
methods frompcommon.Map
to add int and string attributes based on the equivalentAtm
field types. Notice that because those attributes are very specific and only represent theAtm
entity, they are all grouped within the "atm." prefix.
The resource semantic conventions also have prescriptive attribute names and well-known values to represent telemetry generation entities that are common and applicable across different domains like compute unit, environment and others.
So, when you look at the BackendSystem
entity, it has fields representing
OS related information and
Cloud related
information, and we will use the attribute names and values prescribed by the
resource semantic convention to represent that information on its Resource
.
All the resource semantic convention attribute names and well known-values are kept within the /semconv/v1.9.0/generated_resource.go file within the Collector's GitHub project.
Let's create a function to read the field values from an BackendSystem
instance and write them as attributes into a pcommon.Resource
instance. Open
the tailtracer/model.go
file and add the following function:
Notice that I didn't add an attribute named "atm.name" or "backendsystem.name"
to the pcommon.Resource
representing the Atm
and BackendSystem
entity
names, that's because most (not to say all) distributed tracing backend systems
that are compatible with the OTel trace specification, interpret the
pcommon.Resource
described in a trace as a Service
, therefore they expect
the pcommon.Resource
to carry a required attribute named service.name
as
prescribed by the resource semantic convention.
We will also use non-required attribute named service.version
to represent the
version information for both Atm
and BackendSystem
entities.
Here is what the tailtracer/model.go
file looks like after adding the code for
properly assign the "service." group attributes:
model.go
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 |
|
检查你的工作
- Imported the
go.opentelemetry.io/collector/semconv/v1.9.0
package asconventions
, in order to have access to all resource semantic conventions attribute names and values.- Updated the
fillResourceWithAtm()
function by adding lines to properly assign the "service.name" and "service.version" attributes to thepcommon.Resource
representing theAtm
entity- Updated the
fillResourceWithBackendSystem()
function by adding lines to properly assign the "service.name" and "service.version" attributes to thepcommon.Resource
representing theBackendSystem
entity- Updated the
generateTraces
function by adding lines to properly instantiate apcommon.Resource
and fill in the attribute information for bothAtm
andBackendSystem
entities using thefillResourceWithAtm()
andfillResourceWithBackendSystem()
functions
用 spans 表示操作¶
You now have a ResourceSpan
instance with their respective Resource
properly
filled with attributes to represent the Atm
and BackendSystem
entities, you
are ready to represent the operations that each Resource
execute as part of a
trace within the ResourceSpan
.
In the OTel world, in order for a system to generate telemetry, it needs to be instrumented either manually or automatically through an instrumentation library.
The instrumentation libraries are responsible to set the scope (also known as the instrumentation scope) in which the operations participating on a trace happened and then describe those operations as spans within the context of the trace.
pdata.ResourceSpans
has a method named ScopeSpans()
which returns an
instance of a helper type called ptrace.ScopeSpansSlice
. The
ptrace.ScopeSpansSlice
type has methods to help you handle the array of
ptrace.ScopeSpans
that will contain as many items as the number of
ptrace.ScopeSpan
representing the different instrumentation scopes and the
spans it generated within the context of a trace.
ptrace.ScopeSpansSlice
has a method named AppendEmpty()
that adds a new
ptrace.ScopeSpans
to the array and return its reference.
Let's create a function to instantiate a ptrace.ScopeSpans
representing for
the ATM system's instrumentation scope and its spans. Open the
tailtracer/model.go
file and add the following function:
The ptrace.ScopeSpans
has a method named Scope()
that returns a reference
for the pcommon.InstrumentationScope
instance representing the instrumentation
scope that generated the spans.
pcommon.InstrumentationScope
has the following methods to describe an
instrumentation scope:
-
SetName(v string)
sets the name for the instrumentation library -
SetVersion(v string)
sets the version for the instrumentation library -
Name() string
returns the name associated with the instrumentation library -
Version() string
returns the version associated with the instrumentation library
Let's update the appendAtmSystemInstrScopeSpans
function so we can set the
name and version of the instrumentation scope for the new ptrace.ScopeSpans
.
Here is what appendAtmSystemInstrScopeSpans
looks like after the update:
You can now update the generateTraces
function and add variables to represent
the instrumentation scope used by both Atm
and BackendSystem
entities by
initializing them with the appendAtmSystemInstrScopeSpans()
. Here is what
generateTraces()
looks like after the update:
At this point, you have everything needed to represent the telemetry generation entities in your system and the instrumentation scope that is responsible to identify operations and generate the traces for the system. The next step is to finally create the spans representing the operations that the given instrumentation scope generated as part of a trace.
ptrace.ScopeSpans
has a method named Spans()
which returns an instance of a
helper type called ptrace.SpanSlice
. The ptrace.SpanSlice
type has methods
to help you handle the array of ptrace.Span
that will contain as many items as
the number of operations the instrumentation scope was able to identify and
describe as part of the trace.
ptrace.SpanSlice
has a method named AppendEmpty()
that adds a new
ptrace.Span
to the array and return its reference.
ptrace.Span
has the following methods to describe an operation:
-
SetTraceID(v pcommon.TraceID)
sets thepcommon.TraceID
uniquely identifying the trace which this span is associated with -
SetSpanID(v pcommon.SpanID)
sets thepcommon.SpanID
uniquely identifying this span within the context of the trace it is associated with -
SetParentSpanID(v pcommon.SpanID)
setspcommon.SpanID
for the parent span/operation in case the operation represented by this span is executed as part of the parent (nested) -
SetName(v string)
sets the name of the operation for the span -
SetKind(v ptrace.SpanKind)
setsptrace.SpanKind
defining what kind of operation the span represents. -
SetStartTimestamp(v pcommon.Timestamp)
sets thepcommon.Timestamp
representing the date and time when the operation represented by the span has started -
SetEndTimestamp(v pcommon.Timestamp)
sets thepcommon.Timestamp
representing the date and time when the operation represented by the span has ended
As you can see per the methods above, a ptrace.Span
is uniquely identified by
2 required IDs; their own unique ID represented by the pcommon.SpanID
type and
the ID of the trace they are associated with represented by a pcommon.TraceID
type.
The pcommon.TraceID
has to carry a globally unique ID represented through a 16
byte array and should follow the
W3C Trace Context specification
while the pcommon.SpanID
is a unique ID within the context of the trace they
are associated with and it's represented through a 8 byte array.
The pcommon
package provides the following types to generate the span's IDs:
-
type TraceID [16]byte
-
type SpanID [8]byte
For this tutorial, you will be creating the IDs using functions from
github.com/google/uuid
package for the pcommon.TraceID
and functions from
the crypto/rand
package to randomly generate the pcommon.SpanID
. Open the
tailtracer/model.go
file and add both packages to the import
statement;
after that, add the following functions to help generate both IDs:
Now that you have a way to properly identify the spans, you can start creating them to represent the operations within and across the entities in your system.
As part of the generateBackendSystem()
function, we have randomly assigned the
operations that the BackEndSystem
entity can provide as services to the
system. We will now open the tailtracer/model.go
file and a function called
appendTraceSpans()
that will be responsible to create a trace and append spans
representing the BackendSystem
operations. Here is what the initial
implementation for the appendTraceSpans()
function looks like:
检查你的工作
- Added
traceId
andbackendSpanId
variables to respectively represent the trace and the span id and initialized them with the helper functions created previously- Added
backendSpanStartTime
andbackendSpanFinishTime
to represent the start and the end time of the operation. For the tutorial, anyBackendSystem
operation will take 1 second.- Added a variable called
backendSpan
which will hold the instance of theptrace.Span
representing this operation.- Setting the
Name
of the span with theEndpoint
field value from theBackendSystem
instance- Setting the
Kind
of the span asptrace.SpanKindServer
. Take a look at SpanKind section within the trace specification to understand how to properly define SpanKind.- Used all the methods mentioned before to fill the
ptrace.Span
with the proper values to represent theBackendSystem
operation
You probably noticed that there are 2 references to ptrace.ScopeSpans
as
parameters in the appendTraceSpans()
function, but we only used one of them.
Don't worry about it for now, we will get back to it later.
You will now update the generateTraces()
function so it can actually generate
the trace by calling the appendTraceSpans()
function. Here is what the updated
generateTraces()
function looks like:
You now have the BackendSystem
entity and its operations represented in spans
within a proper trace context! All you need to do is to push the generated trace
through the pipeline so the next consumer (either a processor or an exporter)
can receive and process it.
consumer.Traces
has a method called ConsumeTraces()
which is responsible to
push the generated traces to the next consumer in the pipeline. All you need to
do now is to update the Start()
method within the tailtracerReceiver
type
and add the code to use it.
Open the tailtracer/trace-receiver.go
file and update the Start()
method as
follow:
检查你的工作
- Added a line under the
case <=ticker.C
condition calling thetailtracerRcvr.nextConsumer.ConsumeTraces()
method passing the new context created within theStart()
method (ctx
) and a call to thegenerateTraces
function so the generated traces can be pushed to the next consumer in the pipeline
If you run your otelcol-dev
, here is what the output should look like after 2
minutes running:
Here is what the generated trace looks like in Jaeger:
What you currently see in Jaeger is the representation of a service that is
receiving a request from an external entity that isn't instrumented by an OTel
SDK, therefore it can't be identified as the origin/start of the trace. In order
for a ptrace.Span
to understand it is representing an operation that was
execute as a result of another operation originated either within or outside
(nested/child) of the Resource
within the same trace context you will need to:
- Set the same trace context as the caller operation by calling the
SetTraceID()
method and passing thepcommon.TraceID
of the parent/callerptrace.Span
as a parameter. - Define who is the caller operation within the context of the trace by calling
SetParentId()
method and passing thepcommon.SpanID
of the parent/callerptrace.Span
as a parameter.
You will now create a ptrace.Span
representing the Atm
entity operations and
set it as the parent for BackendSystem
span. Open the tailtracer/model.go
file and update the appendTraceSpans()
function as follow:
Go ahead and run your otelcol-dev
again and after 2 minutes running, you
should start seeing traces in Jaeger like the following:
We now have services representing both the Atm
and the BackendSystem
telemetry generation entities in our system and can fully understand how both
entities are being used and contributing to the performance of an operation
executed by an user.
Here is the detailed view of one of those traces in Jaeger:
That's it! You have now reached the end of this tutorial and successfully implemented a trace receiver, congratulations!